
N C RANA • P S JOAG 


-Copyrighted material 












CLASSICAL MECHANICS 


Narayan Chandra Rana 

Tata Institute of Fundamental Research, Bombay 

Pramod Sharadchandra Joag 

University of Poona, Pune 



Tata McGraw-Hill Publishing Company Limited 

NEW DELHI 


McGraw-Hill Offices 

New Delhi New York St Louis San Francisco Auckland Bogota Guatemala 
Hamburg Lisbon London Madrid Mexico Milan Montreal Panama 
Paris San Juan Sio Paulo Singapore Sydney Tokyo Toronto 


Copyrighted 



Hiata McGraw-Hill 

© 1991. Tata McGraw-Hill Publishing Company Limited 

24th reprint 
RLXDRDDXRYKXR 

No part of this publication may be reproduced or distributed in any form or by 
any means, electronic, mechanical, photocopying, recording, or otherwise or 
stored in a database or retrieval system, without the prior written permission of 
the publishers. The program listings (if any) may be entered, stored and executed 
in a computer system, but they may not be reproduced for publication. 

This edition can be exported from India only by the publishers. 

Tata McGraw-Hill Publishing Company Limited. 

ISBN-13: 978-0-07-460315-4 
ISBN-10: 0-07-460315-9 

Published by Tata McGraw-Hill Publishing Company Limited, 

7 West Patel Nagar, New Delhi 110 008, and printed at 
Pushp Prim Services, Delhi 110 053 


The McGraw-Hill Companies 


Copyrighted material 


Contents 


Foreword 

Preface 


1.1 

What is this chapter about 1 


1.2 

What is classical mechanics 2 


1.3 

The place of classical mechanics in physics and some definitions 2 


1.4 

A brief history of the development and mechanics up to Newton 6 


1.5 

Newton’s laws of motion 8 


1.6 

Limitations of Newton’s programme 20 


1.7 

Summary 22 



Problems 23 


1. Constrained Motions in Cartesian Coordinates 

31 

1.0 

Introduction 31 


1 1 

Constraints and their classification 32 


1.2 

1.3 

Examples of constraints 35 

Principle of virtual work 38 


1.4 

The basic problem with the constraint forces 40 


1.5 

Lagrange’s equations of motion of the first kind 41 


1.6 

Gibbs-Appell’s principle of least constraint 45 


1.7 

D’Alembert’s principle 46 


1 ft 

Some additional remarks 50 


1.9 

Work energy relation for constraint forces of sliding friction 50 


1.10 

Summary 52 



Problems 52 


2. JLagrangian Formulation in Generalised Coordinates 

55 


2J) _Introduction 55 

2.1 Change of notation 56 

2.2 Degrees of freedom 57 

2.4 Lagrange’s equations of motion of the second kind 61 

2.5 Properties of kinetic energy function T 63 

2.6 Theorem on total energy 66 

2.7 Some remarks about the Lagrangian 69 


Copyrighted 




































xii Contents 


2.8 Linear generalised potentials 69 

2.9 Generalised momenta and energy 70 

2.10 Gauge function for Lagrangian 72 

2.11 Invariance of the Euler—Lagrange equation** of motion under 
generalised coordinate transformations 73 

2.12 Cyclic or ignorable coordinates 76 

2.13 Integrals of motion 77 

2.14 Concept of symmetry: homogeneity and isotropy 77 

2.15 _Invariance under Galilean transformations 81 

2.16 Lagrangian for free particle motion 83 

2.17 Lagrange’s equations of motion for nonholonomic systems 85 

2.18 Lagrange’s equations of motion for impulsive forces 88 

2.19 Summary 91 


8. Rotating Frames of Reference 


96 


3.1 Inertial forces in the rotating frame 96 

3.2 Electromagnetic analogy of the inertial forces 100 

3.3 _Effects of coriolis force 101 

3.4 Foucault’s pendulum 108 

3.5 Velocity and acceleration of a particle with respect to a 

system having two independent rotations about a common point 110 

3.6 More general case of two rotations separated by one translation 112 

3.7 Summary 114 

Problems 115 

. 4, Ce n tr al F orce_US 

4,0_Introduction 118 

4.1 Definition and properties of the central force 118 

4.2 Two-body central force problem 120 

4.3 Stability of orbits 124 

4A _Conditions for closure 125 

4.5 Integrable power laws of the central force 126 

4.6 _Derivation of force laws from kinematical laws of motion 127 

4.7 Kepler’s problem 131 

4.8 Actual Geometry of orbits and orbital elements 135 

4.9 Kepler’s equation 137 

4.10 Construction of an orbit from given set of initial conditions 139 

4.11 Kepler’s problem in velocity space 140 

4.12 _Orbits of artificial satellites 142 

4.13 Precession of the perihelia of planetary orbits due to 

small perturbing noninverse square law of force 144 

4.14 The basic physics of tides 150 


Copyrighted material 











































Contents xlll 


4.15 Scattering in a conservative central force field 168 

4.16 Virial theorem 171 

4.17 Summary 175 


Problems 176 


5. Hamilton's Equations of Motion 


180 



5.1 Legendre’s dual transformation 181 

5.2 Hamilton’s function and Hamilton’s equations of motion 183 

5.3 Properties of the Hamiltonian and of Hamilton's 

equations of motion 184 

5.4 _Routhian 185 

5.5 Configuration space, phase space and state space 187 

5.6 Lagrangian and Hamiltonian of relativistic particles and 

light rays 189 

5.7 Relativistic mass tensors 192 

5.8 Summary 195 
Problems 196 

6. Principle of Least Action and Hamilton’s Principle 198 

6.0 Introduction 198 

6.1 Principle of least action 199 

6.2 Hamilton’s principle 206 

6.3 Comparison between Fermat’s principle of least action in 

optics and Maupertuis’ principle of least action in mechanics 208 

6.4 Derivation of Euler-Lagrange equations of motion from 

Hamilton’s principle 209 

6.5 Derivation of Hamilton’s equations of motion for holonomic 

systems from Hamilton’s principle 210 

6.6 Invariance of Hamilton’s principle under generalised coordinate 



6.7 Hamilton’s principle and characteristic functions 212 

6.8 _Noether’s theorem 215 

6.9 Lorentz invariance of Hamilton’s principal function for the 

relativistic motion of a free particle 217 

6.10 Significance of Hamilton’s principal 218 

6.11 Summary 219 


7 . Brachistochrones, Tautochrones and the Cycloid Family_222 

7.0 Introduction_ 222 

7.1 The ‘chrone* family of curves 223 
12 Brachistochrone for uniform force field 223 

7.3 Cycloid as a tautochrone 225 


Copyrighted materii 



































xlv Contents 


7.4 Brachistochrone for spherically symmetric potential field V(r) 228 

7.5 Brachistochrones and Tautochrones inside a gravitating 

homogeneous sphere 230 

7.6 Tautochronous motion in a centrifugal force field and 

epicycloids 232 

7.7 Summary 233 

Problems 234 

8. Canonical Transformations _ 236 . 

fLO_Introduction 236 

8.1 Background and definition 237 

8.2 Generating functions 238 

8.3 Properties of canonical transformations 245 

8.4 Some examples of canonical transformations 248 

8.5 Canonical transformation of the free particle Hamiltonian 252 

8.6 Liouville’s theorem 254 

8.7 Area conservation property of Hamiltonian flows 255 

8.8 Summary 257 
Problems 258 

9. The Poisson Bracket 262 

9.0 Introduction 262 

9.1 Definition 262 

9.2 Some useful identities 263 

9.3 Elementary PBs 264 

9*4_Poisson’s theorem 264 

9.5 Jacobi-Poisson theorem (or Poisson’s second theorem) on PBs 265 

9.6 Invariance of PB under canonical transformations 267 

9.7 PBs involving angular momentum 268 

9.8 Dirac’s formulation of the generalised Hamiltonian 270 

9.9 Lagrange bracket (LB) 271 

9.10 Summary 273 

Problems 273 

10. Hamil ton-Jacobi Theory_276 

10.0 _ Introduction 276 

10.1 Solution to the time dependent Hamilton-Jacobi equation and 

Jacobi’s theorem 276 

10.2 _ Connection with canonical transformation _ 279 

10.3 How to find the complete integral of the HJ equation 281 

10.4 Worked-out examples 283 

10.5 Action-Angle variables 292 

10.6 Adiabatic invariants 299 

10.7 Classical-quantum analogies 3 02 

10.8 Summary 309 

Problems 309 


Copyrighted 





































Contents xv 


11. Small Oscillations 311 

11.0 Introduction 311 

11.1 Types of equilibria and the potential at equilibrium 311 

11.2 Study of small oscillations using generalised coordinates 317 

11.3 _Forced vibrations and resonance 327 

11.4 Summary 332 
Problems 333 

12. Rigid Body Dynamics _ 335 

12.0_Introduction 335 

12.1 Degrees of freedom of a free rigid body 336 

12.2 _Euler’s and Chasles’ theorems 338 

12.3 Frames of reference used to describe the motion of a 
rigid body 345 

12.4 Kinetic energy of a rotating rigid body 347 

12.5 Angular momentum 349 

12.6 _Transformations of and theorems on the moment of 

inertia tensor 350 

12.7 Examples of the calculation and the experimental measurement of 
the moment of inertia tensor 358 

12.8 Angular momentum in laboratory and centre of mass frames 366 

12.9 Torque and its relation to angular momentum 368 

12.10 Euler’s equation of motion for rigid body 371 

12.11 Time variation of rotational kinetic energy 372 

12.12 Rotation of a free rigid body 372 

12.13 Poinsot’s method of geometrical construction 373 

12.14 Analytical method of Euler for free rotation and the third 

integral of motion 377 

12.15 Chandler wobbling of the earth 379 

12.16 Motion of w in space for free rotation 381 

12.17 Why should a freely rotating body process at all? 384 

12.18 Steady precession of a uniaxial body (symmetric top) under the 

action of an external torque 386 

12.19 The case of arbitrary rotations 390 

12.20 Addition of two angular velocities 391 

12.21 Eulerian angles 392 

12.22 Motion of a heavy symmetric top rotating about fixed point in the 

body under the action of gravity 396 

12.23 Detailed study of the motion of a symmetric top 398 

12.24 Examples of tops and their analogues 411 

12.25 Forced precession of the earth’s axis of rotation 415 

12.26 Foucault’s gyroscope 420 

12.27 Stability conditions for motions of rigid bodies in rotating frames 423 

12.28 Dynamics of some games and sports 425 

12.29 Summary 440 
Problems 441 


Copyrighted materis 
































xv i Contents 


13. Elasticity 


447 


13.0 

Introduction 447 


13.1 

Displacement vector and the strain tensor 448 


13.2 

13.3 

Stress tensor 455 

Strain energy 459 


13.4 

Possible forms of free energy and stress tensor for 


13.5 

13.6 

13.7 

13.8 

isotropic solids 461 

Elastic moduli for isotropic solids 462 

Elastic properties of general solids: Hooke’s law and stiffness 
constants 464 

Elastic properties of isotropic solids 466 

Propagation of elastic waves in isotropic elastic media 469 


13.9 

Summary 473 

Problems 474 


14. Fluid Dynamics 

476 

14 0 

Introduction 476 


14 1 

A few hasic definitions 477 


14.2 

The central problem of fluid dynamics 478 


14.3 

Equation of state 478 


14.4 

Types of time rates of change of quantities 478 


14.5 

14.6 

14.7 

14.8 

Equation of continuity 480 

Application to Liouville’s theorem 482 

Equations of motion 482 

Pressure potential 483 


14 9 

External force field 484 


14.10 

14.11 

Cases of equilibrium fluid distribution in presence of 
external fields 485 

Bernoulli’s theorem 486 


14.12 

14.13 

14.14 

Applications of Bernoulli’s theorem 491 

Gravity waves and ripples 496 

Two-dimensional steady irrotational flow of incompressible' 


14.15 

14.16 

14.17 

14.18 

fluids. 502 

Kelvin’s and Helmholtz’s theorems 510 

Representation of vortices by complex functions 514 

Flow of imperfect fluids 516 

Summary 521 

Problems 521 


Appendix A1 Coordinate Frames 

Al.l Orthogonal coordinate frames 525 

A1.2 Nonorthogonal or oblique coordinate frames 531 

525 

Appendix A2 Vector Calculus 

534 


A2.1 Introduction to Kronecker delta and Levi-civita symbols 534. 


Copyrighted material 




















Contents xvil 


A2.2 Partial differentiation of vectors and scalars 536 
A2.3 Ordinary differentiation of vectors 537 
A2A Vector integration 538 

A2.5 Tangent, principal normal and binormal of orbits 540 
A2.6 Kinematics of particle motion 543 

A2.7 Kinematics in spherical polar and other coordinate frames 545 
A2.8 Vectors in orthogonal curvilinear coordinate systems 547 
A2.9 Vectors in general curvilinear coordinates 551 


Appendix A3 Tensors 


554 

A3.1 Formal concepts of scalars and vectors 554 


A3.2 Tensors 558 


Appendix B Sample of Short Questions 

564 

Class test I 

564 


Class test II 

565 


Class test III 

567 


Class test IV 

568 


Final examination 569 


Appendix C Hints and Answers to Selected Problems 

572 

Introduction 

572 


Chapter 1 

575 


Chapter 2 

575 


Chapter 3 

577 


Chapter 4 

577 


Chapter 5 

579 


Chapter 6 

580 


Chapter 7 

580 


Chapter 8 

581 


Chapter 9 

581 


Chapter 10 

582 


Chapter 11 

583 


Chapter 12 

584 


Chapter 13 

585 


Chapter 14 

586 


Appendix D Physical Constants 

588 

Bibliography 


590 

Index 


594 


Copyrighted materii 


























Values of Fundamental 
Physical Constants 


Universal Constants 


speed of light in vacuum c_ 

permeability of vacuum /io 

permittivity of vacuum, l//*o c 2 eo 

Newtonian constant of gravitation G 
Planck constant h 

in electron volts, h/e 
h/2jr h 

in electron volts, h/e 


Atomic and Nuclear Constants 


elementary charge e 

fine-structure constant, noce 2 /2h a 

Bohr radius, a/AnRoo 
electron mass m e 

in electron volts, m e c 2 /e 
Compton wavelength, h/m e c A e 

A c /2ir = aoo = a 3 /47rf? 00 A e 

classical electron radius, a 2 a o r e 

Thomson cross-section, (8ir/3)r e 2 <r e 

electron magnetic moment n t 

in Bohr magnetons Pt/HB 

proton mass m p 

in electron volts, mpC 2 /e 
neutron mass m n 

in electron volts, m n c 2 /e 
Avogadro constant N^, L 


299 792 458 

ms 1 

4tt x 10 _T 

NA- 2 

= 12.566 370 614... 

10" 7 NA" 2 

8.854 187 817... 

10" 12 Fm- 1 

6.672 59(85) 

10 -11 m 3 kg -1 8" 

6.626 075 5(40) 

10“ 84 J s 

4.135 669 2(12) 

lO" 15 eV s 

1.054 572 66(63) 

10- 34 J s 

6.852 122 0(20) 

10- 18 eV s 


1.602 177 33(49) 

io- 1# C 

7.297 353 08(33) 

lO" 3 

0.529 177 249(24) 

lO" 10 m 

9.109 389 7(54) 

lO" 31 kg 

5.485 799 03(13) 

10“ 4 amu 

0.510 999 06(15) 

MeV 

2.426 310 58(22) 

10- 12 m 

3.861 593 23(35) 

10- 13 m 

2.817 940 92(38) 

lO" 15 m 

0.665 246 16(18) 

lO" 28 m 2 


928.477 01(31) 10~ 2 * JT" 1 

1.001 159 652 193(10) 


1.672 623 1(10) 10-” kg 

1.007 276 470(12) amu 

938.272 31(28) ' MeV 

1.674 928 6(10) 10"” kg 

1.008 664 904(14) amu 

939.565 63(28) MeV 

6.022 136 7(36) 10” mol" 1 


Copyrighted material 



XX Values of Fundamental Physical Constants 


atomic mass constant 

m u = Y^m( l7 C) 

m u 

1.660 540 2(10) 

10~ 27 kg 

in electron volts, m u c 2 /e 


931.494 32(28) 

MeV 

molar gas constant 

R 

8.314 510(70) 

J mol" 1 K" 1 

Boltzmann constant, R/N a 

k 

1.380 658(12) 

10- 23 J K' 1 

in electron volts, k/e 


8.617 385(73) 

10" 5 eV K" 1 

in hertz, k/h 


2.083 674(18) 

10 10 Hz K" 1 

in wavenumbers, k/hc 


69.503 87(59) 

m" 1 K" 1 

Stefan-Boltzmann constant, 

(7r 2 /60)Jfe 4 /A 3 c 2 

(T 

5.670 51(19) 

IQ" 8 W m" 2 K" 4 


Astronomical Constants 




heliocentric gravitational constant 

GMq 

1.327 124 38 

10 2 ° m 3 s' 2 

geocentric gravitational constant 

gm 9 

3.986 004 48 

10 14 m 3 s- 2 

Astronomical unit 

1 AU 

1.495 978 706 6 

10 11 m 

equatorial radius of sun 

R® 

6.959 9 

10 8 m 

equatorial radius of earth 

R ® 

6.378 137 

10 8 m 

angular velocity of earth 


7.292 115 146 7 

10- 5 s- 1 

ratio of earth’s mass to moon’s mass 

M 0 /AJ, 

81.300 813 


radius of moon 


1.738 2 

10 8 m 


Copyrighted material 



Introduction 


1.1 WHAT IS THIS CHAPTER ABOUT? 

The present chapter is devoted to briefly recapitulating the topics that are usually covered 
at the lower levels. The first two sections are quite formal, and at places may even appear 
to be pedantic. However, the reader need not lose heart if lost in too many definitions or 
concepts. The reader may as well omit these two sections in the first reading. In (he third 
section one would find a brief historical note on the development of ideas, mostly related to 
mechanics up to the time of Newton. Even though it is quite brief, reading it may be found 
amusing. We really come to the business from the fourth section onwards. It is presumed 
that a student would have already spent a number of years studying various aspects of 
Newton’s laws of motion. Even writing a summary of all that would surely take many more 
pages than we have spent. Nevertheless, we have tried to emphasize the significant aspects 
of Newton’s laws of motion, and some of their applications. Should any more important 
items be included, the authors would appreciate receiving specific suggestions. 

We plan to give in the introduction to every chapter, a brief note about the most eminent 
person, if any, marked for the development of material that is covered in that chapter. In 
this chapter, it is obviously Newton. 

Born prematurely, a physical weakling, Sir Isaac Newton (1642 - 1727) had to wear 
a bolster to support his neck during his first months; and no one expected him to live. 
Newton’s father died three months before he was born, his mother remarried and he was 
left with his aged grandmother. He entered Trinity College in 1661. Not so distinguished 
as a student, he failed in 1663 in a scholarship examination due to awful inadequacy in 
geometry. The fateful Plague years of 1665 - 1666, he spent away from Cambridge and 
these were the most fruitful years of his life, laying the foundation for his future greatness in 
optics, dynamics and mathematics. Returning to Cambridge, he first became a minor fellow 
at Trinity in 1667 and a major fellow the next year. His mathematics professor, Isaac Barrow 
recognised his genius, and in a rare act of self-abnegation, Barrow resigned his professorship 
so that the young and more promising Newton could have it. So Newton was offered the 
Lucasian Chair of Mathematics in 1669. At Cambridge, Newton became the very model of 
an absent-minded professor. He was never known to indulge in any recreation or pastime, 
either in riding out or taking the air, walking, bowling, or any other exercise whatever, 
thinking hours lost in such activities, were better spent in studies. He often worked until 


Copyrighted 



2 Classical Mechanics 


two to three o’clock in the morning, ate sparingly and sometimes forgot to eat altogether. 

He presented his first paper, in optics, in 1672 to the Royal Society, and was immediately 
elected a fellow of the Society. However, his theory of light and colour brought him into great 
controversy with Christiaan Huygens and Robert Hooke, and he vowed not to publish any 
of his discoveries. During the Plague years, he completely solved the problem of colliding 
bodies, discovered the law of centrifugal force and got the idea of gravitation. He solved 
the Kepler problem (actually the inverse of it) in 1679, and the wonderful theorem proving 
that a homogeneous gravitating sphere attracts all points outside it as if its mass were 
concentrated at its centre in 1685. It is only at the insistence of Sir Edmond Halley that he 
finally decided to publish his works. He composed his magnum opus the Principia sometime 
between autumn 1684 and spring 1686, which was finally published in July 1687 in three 
volumes. Since then Principia has been regarded as the Bible of mechanics. 

In 1696, he abandoned the academic life for the position of a Warden, later the Master, 
of Mint. In 1705, he was knighted by the King and later served many years as the President 
of the Royal Society. His book on optics was published in 1704. 

The plan of the present book is given in the sixth section with specific reference to the 
limitations of Newton’s program. The concept of the whole book is to present most of the 
important post-Newtonian developments of classical mechanics. So please do not miss this 
section. The chapter ends with a set of fifty problems, with some hints for the solutions, at 
the end of the book. We expect that an average student would be able to solve half of the 
problems even without looking at the hints. 


1.2 WHAT IS CLASSICAL MECHANICS ? 

Classical mechanics is that branch of physics which deals with the description and expla¬ 
nation of the motion of point-like as well as extended, rigid as well as deformable objects 
embedded in a three-dimensional Euclidean space. The part of mechanics which deals only 
with the geometrical description of the motion, such as Galileo’s laws of falling bodies, or 
Kepler’s laws of planetary motion, is called kinematics. The part which offers causal expla¬ 
nation of the motion along with its description, such as the application of Newton’s laws of 
motion, Newton’s law of gravitation, etc., is called dynamics. The present book is primarily 
concerned with classical dynamics of Newton and his followers. 


1.3 THE PLACE OF CLASSICAL MECHANICS IN PHYSICS AND SOME 
BASIC DEFINITIONS 

Classical mechanics is the oldest branch of physics. Some of the greatest minds of all 
times, such as Sir Isaac Newton, Joseph Lagrange, Leonhard Euler, Simon Laplace, Henry 
Poincare, Sir William Hamilton and Carl Jacobi, laid the foundation and built the theoretical 
structure of the subject. The well formulated structure of classical mechanics has in fact 
provided an ideal paradigm for the structural development of the relatively new branches of 
physics, such as electrodynamics, relativistic mechanics, quantum mechanics and statistical 


Copyrighted 



Introduction 3 


mechanics. In this section we would like to explain briefly what is meant by classical in 
classical mechanics, in reference to other modem branches of mechanics and while doing so, 
introduce a few related concepts and definitions. 

A particle is ideally defined to be any point-like object or entity; however, it can have any 
finite extension provided its extension is irrelevant to the study of the motion of the object 
as a whole. For example, the motion of the earth around the sun can be studied assuming 
the whole earth to be a particle, while the motion of a billiard ball or a disc will require a 
knowledge of its shape and distribution of matter within and therefore cannot be regarded 
as a particle. Even an atom or a molecule may or may not be regarded as a particle; it 
all depends on the requirement whether its extension is relevant to the study of its motion 
or not. Classically, a particle is endowed with some mass and electric charge (if it is not 
electrically neutral), whereas an extended object is characterised by not only its total mass 
and charge, but also their distributions throughout the body, giving rise to concepts such 
as moment of inertia, electric dipole and multipole moments, etc. Classically, the spin of a 
body can be imagined only if it has got an extension and therefore the concepts of moment 
of inertia and spin are inseparable in classical mechanics. However, in quantum mechanics a 
particle can have mass, charge and also spin; that is why the spin of an elementary particle 
is said to be a purely quantum concept. This is one of the major differences between the 
concepts of particle in classical and quantum mechanics. 

The concept of a classical particle endowed with mass and electric charge represents its 
ontological (existential) status. Next comes the concept of motion of the particle, which 
requires introduction of two more fundamental concepts, namely space and time. The con¬ 
cept of space is inherent in the idea of the extension of any object and also in the idea of 
separation between two particles. Essentially, the mere allowance for the discreteness of an 
entity in the form of more than one particle in the universe necessitates the introduction of 
the concept of space. Up to this point space and matter together give only a static organi¬ 
sation of matter in space. If this organisation of matter is not found to be static, that is, if a 
change in the total organisation is perceived, one can retain the same space as the common 
substratum for the description of the changes that occur continuously and these continuous 
changes in the organisation of matter constitute the motion of the constituent objects or 
entities. Time is the parameter that characterises the changes in (or, equivalently, the mo¬ 
tion of) the organisation of matter in space. Space and time do not have any ontological 
status. They are merely concepts in the mind of a conscious observer. These concepts allow 
motion to be possible and entities to have both extensions and coexistensibility in plural 
forms. However, in classical mechanics there is hardly any place for mind and consciousness 
and therefore, the observer is most conveniently replaced by the idea of a frame of reference. 
But in quantum mechanics, the role of a conscious observer in the process of measurement 
has been one of the central issues right from its inception. 

Now with respect to a given reference frame S 0 , the so called impersonated observer of 
classical mechanics, one postulates the existence of a three dimensional Euclidean space E 3 , 
such that the position of any particle with respect to S a can be represented by a point in 
E 3 . The motion of a particle is defined to be the change of its position relative to 5„. In 
classical mechanics, motion is assumed to be continuous and hence each particle describes a 
continuous orbit or trajectory in E 3 . Since time is to be a measure of motion, a physical clock 


Copyrighted material I 



4 Classical Mechanics 


that measures time has to measure the motion of some particle in it. An instant is defined 
to be the cursory position of this chosen particle of the physical clock, such as the tip of the 
'second’ hand of a clock, or the position of the sun’s centre in the sky, or the state of an 
electrically charged particle in a piece of suitably-cut quartz crystal, etc. Now, by making a 
one-to-one correspondence between the continuous orbit of this particle of the clock and the 
orbit of any other given particle in the universe we can ascertain that at every instant, each 
particle in the universe has a unique position with respect to any given frame of reference. 
This continuous nature of space, time and the orbit or trajectory of any classical particle is 
often implied by the use of the phrase classical, as opposed to quantum which necessarily 
allows indeterminacy in position and time at the cost of continuity and sharpness of the 
orbit of so called quantum particles, the latter behaving like packets of waves diffused in 
space and time. Thus in classical mechanics every real object, be it a particle or an extended 
body which is rigid, elastic or fluid, has a continuous history in space and time. 

So far we have defined space, time and motion in terms of geometry, but physics is also 
concerned with the quantitative description of nature. This means that one has to associate 
geometry with numbers. It was a German mathematician, Georg Cantor, who proved in 
1873 that there is a one-to-one correspondence between the set of all real numbers and the 
set of all points in any finite or infinite fine, surface area, 3-D or higher (up to countable 
infinite) dimensional volumes. The whole is then equal to a part of itself! For example, 
for every possible real number x that lies between 0 and oo the value of e -z always lies 
between 0 and 1. Again, whatever numbers you choose between 0 and 1, you can multiply 
all of them by any arbitrarily chosen small number e, so that the number of real numbers 
between 0 and e is exactly the same as those between 0 and 1, which is also precisely the 
same as the number of real numbers between 0 and oo. The same is true for the total 
number of points in any infinitesimally small but finite or infinitely large segment of a line 
or of an area or of a volume. This exact equivalence between geometry and real numbers 
has made the geometrical representation of algebraic equations possible, together with the 
freedom to arbitrarily choose the scale, that is, the length of the segment of any axis one 
chooses to represent the intervals between 0,1,2,..., etc. Because of the above freedom, 
two persons never draw a graph identically even though the two graphs representing the 
same equations are mathematically equivalent. To a physicist, this freedom has, first of Jill, 
given an opportunity to express space, time, mass and charge in a quantitative fashion and 
second, an absolute freedom to choose the units of measurements. The International Bureau 
of Standards (BIH) in Paris has adopted some arbitrary definitions of the SI units. The four 
primary units are, M,K,S, and I, namely, to quantify matter (mass and charge or electric 
current) and motion (space and time). There are three other primary units, kelvin, mole and 
candela for measuring absolute temperature, number of atoms and luminous intensity of any 
radiating source, respectively. All other concepts in physics and their corresponding units 
are defined by prescribing implicit or explicit relations among the already defined ones. All 
laws of physics tire merely some statements of these inter-relations among various concepts, 
each of which is quantifiable with some uniquely defined units. If any of the above seven 
primary units is redefined by the BIH, for the sake of consistency, all the related derived 
units will have to be accordingly redefined. 

Once the units arc defined, position, time, etc. can be exactly quantified. At any instant, 


Copyrighted material 



Introduction 5 


the position of any particle in any n dimensional Euclidean space E n with respect to 
any given reference frame is expressed by an ordered set of n independent real numbers 
(*i, X 2 ,...,Xn) ) say, each being called a position coordinate. The very fact that we can 
draw at most three mutually perpendicular axes at any given point in our real world, and 
that the transverse electromagnetic (elastic) waves exist and can freely propagate in space 
(in the elastic medium), show that the dimensions of the space embedding classical and 
electromagnetic phenomena are to be three, or more precisely, at least three. Similarly, 
the time 1 at a given instant, as read by a clock, is expressed as a real number. Thus 
the equation of the orbit of any classical particle in E 3 is given by the explicit functions 
Xi(<), *a(0 and x 3 (<). Now, what happens when we go to another frame which has got its 
own clock and meter stick, calibrated in the same units as those in the original frame ? We 
make the following two assumptions in classical mechanics while going from one frame of 
reference to another: 

(i) All instantaneous readings of the clocks identically calibrated but located in different 
frames are absolutely identical. 

(ii) The individual position coordinates of any given particle at any given instant may 
be different in different frames, but must satisfy the basic Euclidean condition, that is, the 
distance between any pair of particles in any given frame at a given instant is the same. 

We know that these assumptions are violated in frames which have got bodies moving 
with extremely high velocities, comparable to that of light in vacuum (c = 299792.458 
km/s, exact by definition). At such high speeds, the time differences between any two given 
instants are measured differently in different reference frames, and these deviating time 
differences are related to their Euclidean distance measurements by certain rules. These 
rules were prescribed by a new theory, the special theory of relativity, advanced by Albert 
Einstein in 1905. The measurements of space, time and mass thus become frame-dependent 
quantities in this new theory, and classical mechanics is replaced by relativistic mechanics. 
However, at speeds less than, say, 1000 km/s, the results obtained using the laws of classical 
mechanics differ from their relativistic counterparts at the fifth significant digit at the most, 
and hence are not significantly different. Usually, relativistic mechanics is treated separately, 
but we shall occasionally pick up some interesting problems or examples from relativistic 
mechanics as illustrations. 

The most important point about the theory of relativity is the union of space and time 
forming a single continuum (Someone found it in the very name of Einstein, whose break 
up is suggested to be EIN+ST+EIN, ‘ein’ meaning ‘one’ in German and ST being the 
abbreviation for space and time !) In his general theory of relativity, Einstein has gone one 
step further and combined space-time and matter, one being able to influence the other. As 
a result, a local fluctuation in space-time can propagate as a gravitational wave, which is 
an entity as real as the electromagnetic waves. So in Einstein’s hand, not only have space 
and time united to form a single continuum, but they have also been promoted to having an 
ontological status, which, in the premise of classical mechanics is enjoyed by matter alone 
and in electrodynamics by charged particles and their fields. Furthermore, the question of 
the physical reality of the above trinity of space, time and matter, as Einstein had perceived 
it, is now undergoing the acid tests of quantum mechanical experiments. It is doubtful 
whether quantum reality can be at all independent of any conscious observer, in which case 


Copyrighted material 



6 Classical Mechanics 


quantum mechanics may not be all that mechanical ! In this respect, classical mechanics is 
absolutely mechanical. With this mechanical view of the world in mind, we shall proceed 
for the rest of the book. 


1.4 A BRIEF HISTORY OF THE DEVELOPMENT OF MECHANICS UP TO 
NEWTON 

The Greek philosopher-scientist Aristotle (384 - 322 BC) was the first to suggest in his 
book Physics a quantitative law of motion, which states that the velocity of any object (t>) 
is proportional to the applied force (F) and inversely proportional to the resistance ( R ), 
that is, 



where A; is a constant. Till the beginning of the fourteenth century, Aristotlian ideas 
prevailed throughout Europe, though not without criticisms — particularly, noting that 
some minimum force is normally needed in order to impart motion to any body that rests 
on some other body. One of the early medieval scholars, Avempace (1106 - 1138) gave an 
alternative law to replace Aristotle’s, which states that 

v = k(F - R) 

according to which a minimum of force F = R is required to initiate motion. It is now 
known that the same law was proposed by Johannes Philoponus at the end of the sixth 
century AD. 

From AD 1300 onwards, two medieval schools, one centred around Merton college, Oxford, 
and the other centred in Paris, began to contribute substantially to the development of 
mechanics. Richard Swinehead was the first to define uniform local motion to be ‘one in 
which in every equal part of the time an equal distance is described’. William Heytesbury 
correctly conceived the idea of acceleration: ‘any motion whatsoever is uniformly accelerated 
if in each of any equal parts of time, it acquires an equal increment of velocity’. He was 
also the first to define instantaneous velocity. By 1350 the Mertonian school arrived at the 
correct mean speed theorem, which simply states that the mean speed over any interval of 
time for a uniformly accelerated motion starting from rest is exactly the half of the final 
speed. The proof of this theorem readily came from Nicole de Oresme (1325 - 1382) of the 
Parisian school. 

Meanwhile Thomas Bradwardine (1290 - 1349) in his Tractatus de Proportionibus gave in 
1328 another law of motion, which reads as The proportions of the proportions of motive to 
resistive powers is equal to the proportion of their resistive speeds of motion, and conversely. 
In effect this means 

v = fcl °g ^ 

It should however be remembered that the idea of logarithm was introduced by John Napier 
in 1614. Jean Buridan of the Parisian school introduced a term impetus (/) to represent 
the impressed force of moving bodies, which was defined as the product of velocity and the 


Copyrighted material 



Introduction 7 


quantity of matter. He also explained the cause of free fall as the increase in the impetus. 
Subsequently, Marsilius of Inghen (1340 - 1396) distinguished between rectilinear and the 
circular impetus. William of Ockham (1300 - 1350) was the first to separate the problem of 
kinematics from that of dynamics. According to him, kinematics deals with the definition 
and measurement of motion, whereas dynamics, with the measurement of forces and their 
effects. 

The next stage of major developments took place in the arena of celestial mechanics. The 
Polish astronomer Nicolaus Copernicus (1473 - 1531) put forward the heliocentric theory 
of the solar system, replacing the astronomer Claudius Ptolemy’s geocentric one (150 AD). 
Not long after Copernicus’ death, a Danish nobleman Tycho de Brahe (1546 - 1601) began 
a series of observations on Mars and the other planets but died before he could properly 
analyse the data. It was one of his young assistants, Johannes Kepler (1571 - 1630) who, 
having got access to these data could finally formulate his three celebrated laws of planetary 
motion (the first two were published in 1609 and the third in 1618) which can be stated as 
follows. 

(i) The planets orbit the sun in ellipses with the sun at one focus 

(ii) The line joining the sun and the planet sweeps equal areas in equal intervals of time; 
and 

(iii) The squares of the orbital periods of the planets are directly proportional to the 
cubes of the mean distances of the planets from the sun. 

While Kepler was busy in formulating these laws, Galileo Galilei (1564 - 1642), the 
famous Italian scientist, made a telescope in 1610. By observing Jupiter’s moons and the 
phases of Venus, he confirmed the Copernican heliocentric theory of the solar system. He 
also performed the famous ‘Tower of Pisa’ experiment during 1589 - 1632, but in the be¬ 
ginning supported Aristotle’s view namely the downward movement of any body endowed 
with weight is quicker in inverse proportion to its size. In doing so he was in fact giving due 
respect to his raw observational results, viz., due to air resistance, one actually finds that 
heavy and dense bodies descend faster than the bulky and lighter ones. It was around 1638 
that he finally formulated the three laws of falling bodies which go by his name, basically 
asserting that if one completely removes the resistance of air, all materials would descend 
with equal acceleration. In 1632 he published the principle of conservation of motion on 
any frictionless horizontal plane. This law is often referred to as the Galilean law of inertia. 
Galileo was also the first to note the isochronous motion of simple pendula, that is the period 
of a simple pendulum is independent of its amplitude of oscillation. He is regarded as the 
true father of physical science since he totally broke the age old tradition of accepting the 
supremacy of pedagogical arguments. He put every scientific assertion to direct experimen¬ 
tal or observational test as a necessary condition for its viability as a scientific statement. 
Following this spirit of scientific investigation, Europe soon became the birthplace of prac¬ 
tically all the scientific and technological developments that were to follow. The Italian 
Academy of Sciences was established in 1607, the Royal Society of London in 1660 and the 
French Academy of Sciences in 1666. 

The French philosopher Ren6 Descartes (1596 - 1650) strongly opposed any possibility 
of action at a distance. Every action, according to him, has to be transmitted necessarily 
through the physical contact of material bodies. However, he is chiefly remembered for his 


Copyrighted 



8 Classical Mechanics 


idea of Cartesian coordinates. The Dutch mathematician Willebrord Snell (1591 - 1626) 
gave the laws of refraction of light (1621) and also determined the radius of the earth quite 
accurately (1625). The Italian physicist Evangelista Torricelli (1608 - 1647) developed 
parabolic ballistics (1640) as a consequence of Galileo’s laws of falling bodies. He also 
invented the mercury barometer (1643). Another French amateur mathematician Pierre de 
Fermat (1601 - 1665) gave the principle of least time for the propagation of light rays in any 
medium (1657), based on Snell’s law of refraction. Blaise Pascal (1623 - 1662), a French 
mathematician, developed the theory of hydrostatics in 1637. Sir Christopher Wren (1632 - 
1723), the famous English architect and mathematician, formulated some laws of collision 
of elastic bodies, introduced parabolic mirrors in telescopes (1669), and fixed the standard 
length of oscillation of a pendulum clock (1671). The Dutch physicist Christiaan Huygens 
(1629 - 1695) gave the correct laws of colliding bodies (1656) and the kinematics of circular 
and the isochronal motion along a cycloidal track (1673) and, of course, his theory of light 
propagation in the form of undulatory waves (1678). The English physicist Robert Hooke 
(1635 - 1703), proposed the so-called Hooke’s law of elasticity (1675) and correctly guessed 
the inverse square law of force between the sun and the planets (1679). The Italian physicist 
Giovanni Borelli (1608 - 1679) talked about the centrifugal forces acting on bodies moving 
in circular orbits. Most of these people were in fact aware of Kepler’s laws of planetary 
motion and wanted to understand the dynamics behind them. However, they were so much 
influenced by the Cartesian antithesis of any kind of action-at-a-distance, that they found 
it difficult to imagine any force operative between the sun and the planets across the cosmic 
void. 

Sir Isaac Newton (1642 - 1727), the greatest scientific genius of all time, was the first to 
successfully explain not only Kepler’s laws of motion but also myriads of other phenomena 
and problems of that time. He was one of the co-inventors of the calculus. He formulated 
the laws of motion, the law of gravitation, studied the motion of particles — both in free 
space and in presence of resistive media, just to name a few topics which now form the basis 
of classical mechanics. The book he wrote, in three volumes, The Philosophiae Naturalis 
Principia Mathematica (in short, Principia) was published by the Royal Society of London 
on July 5, 1687, on the insistence and kind patronage of Sir Edmond Halley, and is said to 
be a mark of the most original creativity ever produced by a single person in the history of 
mankind. 


1.5 NEWTON’S LAWS OF MOTION 

According to Newton, any change in the motion of an object, described with respect to a 
given frame of reference, is the result of the mutual interaction between the object and its 
environment. The central problem of mechanics is to understand and quantify the connec¬ 
tion between these interactions and the resulting motion. Regarding these interactions as 
the cause, and the motion as the effect, one is to quantify the relation between cause and 
effect. It is natural to expect that the interactions causing the motion can be quantified in 
terms of the measurable physical properties of the body and its environment, e.g., mass, 
electric charge, magnetic dipole moment, etc. 


Copyrighted material 




Introduction 9 


Newton gave a programme to attack this problem which comprises two steps: 

(i) A vector quantity called force (jF) is regarded as the cause of change in the state of 
motion of a body, or in other words, the vehicle of interaction between the moving object 
and its environment. The force acting on the body can cause acceleration (a) which is a 
vector quantity, like the force. (Vectors are briefly dealt with in appendix A2.) 

(ii) The forces acting on the body are calculated on the basis of the properties of the 
body and its environment, requiring determination of the appropriate force laws. 

Newton’s formulation of step (i) above forms the basis of his laws of motion. These laws 
of motion are valid in a class of reference frames called inertial frames. In fact, we can turn 
around and define the inertial frames to be those frames of reference in which Newton’s laws 
of motion are valid. Newton’s laws can be stated as follows: 

(i) Law of Inertia 

In an inertial frame, every free particle (that is, a particle not acted upon by a net external 
force) has a constant velocity. 

The original version of this law written in Latin was slightly different. When translated 
into English, it reads: ‘every body free of impressed forces either preserves a state of rest or 
continues in uniform rectilinear motion ad infinitum .’ 

In an inertial system a free particle undergoes equal displacements in equal intervals of 
time. This fact defines a time scale or a clock for inertial frames called inertial time scale. 

Motion of free particles in inertial frames will be in straight lines. For, if this motion 
were on a curve with non-vanishing curvature, the velocity of this free particle, which is a 
vector tangent to the path of the particle, would change with time, contradicting the first 
law. Thus, a path traced by a free particle in an inertial frame defines a straight line in that 
frame. 

Since a free particle covers an equal measure of space in equal measure of time, ad 
infinitum , it implies that along the straight line of the path, space is uniform or homogeneous, 
and so also time. Again, since the direction of the straight line path could have been any, 
it also implies that space is isotropic. So an inertial frame is also interpreted to stand for 
the homogeneity and isotropy of space, and homogeneity of time. 

Since free particles travel in straight lines, an ideal inertial frame would be the one that 
has all the axes as straight lines. Only the oblique and rectangular Cartesian coordinate 
systems satisfy this requirement. Other coordinate systems, for example, a spherical polar 
coordinate system has as coordinates the polar angle 8 and the azimuthal angle 0, which 
cannot change without violating the rectilinear property of inertial motion. 

We see that the first law requires the notion of a free particle, which depends on the 
definition of force given by the second law. 

(ii) Law of Causality 

If the total force exerted on a particle by other objects at any specified time is represented 
by a vector F , then 


F = mo = dp/dt (1.1) 

where a = dv/dt is the acceleration of the particle at the given instant, m is the mass 


Copyrighted material 



10 Classical Mechanics 


of the particle, v is the velocity of the particle at that instant and p = mv is the linear 
momentum. The vector quantity F is called force and Eq. (1.1) above, is taken to be its 
definition. This law is a complete law. 

Newton’s original version of the second law also reads somewhat differently: ‘The change 
of motion Av is proportional to the motive force A I impressed, and is made in the 
direction of the right line in which that force is impressed.’ Thus if we consider all these 
changes to take place in time A t , and take the constant of proportionality to be 1/m, in 
the limit of At -* 0, we get the usual form. It should be noted that vector notations 
were formally introduced in physics by Willard Gibbs in 1901; Newton and many others 
did not use vector quantities as such. Every time, they wrote out all the Cartesian or polar 
components explicitly, depending on the coordinate system that they were using. 

Through this law, the study of the motions of bodies became part of a new branch of 
science called dynamics. By itself it is not a verifiable law, to start with. For the first time 
it defined the notion of force through a directly measurable quantity called acceleration and 
another quantity called mass. Measurement of mass, or the quantity of matter, in a given 
body would have been extremely difficult, had we lived on a planet where uniformity of 
gravity could not be assumed as an approximation. Again, since Newton had to develop 
the science of mechanics from almost nothing other than his three laws of motion, he could 
not initially check the validity of his laws of motion. He derived the first universal law of 
force, that is, the inverse distance-square law of the gravitational force, from a combination 
of Kepler’s laws of planetary motion and his own three laws of motion. Having obtained 
a law of force, he could then have many situations where he would know the values of 
mass, acceleration and force, independent of his second law of motion, and therefore test 
its validity. Actually he had been working on these ideas since 1665 or so, but published 
with great confidence only after 22 years, when he became sure that these were the laws of 
nature. * 

The second law is a prescription for formulating the dynamical equations of motion in 
inertial frames. The first law has already defined what inertial frames are. They are rectan¬ 
gular Cartesian frames in which a free particle either stays at rest or continues with uniform 
rectilinear motion ad infinitum. 

It is now important to note that the force of gravitation is all pervading. It is called 
a body force, since this force acts at each point of the body. It is also called external or 
applied force for obvious reasons. However, in real life situations we very often come across 
various kinds of contact or surface forces, forces produced due to collision, or hindrance to 
natural motion. Since Newton’s second law demands the a priori knowledge of the total 
force that a body experiences, this total force must also include all the forces of reactions 
that it experiences; in the third law, Newton prescribes the general nature of the forces of 
reaction in relation to the forces of action. 

(Hi) Law of Reciprocity 

To the force exerted by every object on a particle, there corresponds an equal and opposite 
force exerted by the particle on that object. 

For two interacting particles, if F 2 1 is the force exerted by the first particle on the second, 


Copyrighted material 



Introduction 11 


and Fu rttfre force exerted by the second particle on the first, we must have 
F12 — F 2 l 

Using the second law, we have, then 

^(Pi + P2) = 0 

where p, and p 2 are the linear momenta of the two particles 1 and 2 respectively. This 
means that the total linear momentum + p 2 is a constant of motion. In other words, 
the total linear momentum of any isolated pair of mutually interacting particles, expressed 
as a vector sum of quantities p l and pj, is conserved. 

If all the possible actions and reactions are found to be totally confined within a system, 
such a system is, by definition, called a closed system. By the third law, such a system 
must conserve the total linear momentum. Again, a closed system is not acted upon by any 
externally applied forces. Hence, by the first law, a closed system as a whole must act as 
an inertial frame. Thus, we can have even a time bomb at rest as a closed system which, 
on explosion, can disintegrate apart, but it would happen in such a way that its centre of 
mass still continues to remain stationary and the system as a whole can still be regarded as 
a closed system. However in the process we have released some potent chemical energy in 
the form of systemic motions of the splinters. 

The third law says that if there exists some action on some particle, then the rest of the 
universe, or the remaining part of the closed system under consideration must experience 
the reaction. So if you raise your hand, the rest of the universe is going to share its reaction. 
Therefore all possible motions in the universe are constantly getting modified due to some or 
other ongoing actions and reactions that take place here and there. But then how to begin 
or trigger an action? By nature a trigger is always a kind of spontaneous action , be it in the 
form of spontaneous decay of particle, or a nuclear or chemical reaction, always releasing 
some potent energy in the form of kinetic energy. Even today, trigger or spontaneous actions 
are still considered to be mysterious processes of nature. However, if this is the only ultimate 
process of generating motion, then the third law is the most fundamental law of nature. 

(iv) Law of Superposition 

The total force F due to several objects acting simultaneously on a particle is equal to the 
vector sum of the forces F*, due to each object acting independently, that is, 

r = (i-2) 

k 

This is a ‘divide and conquer’ rule for solving mechanical problems involving complex 
forces. There is no unique way of dividing the total F into a number of components. In 
other words, for a given F there are, in general, infinity of solutions of Eq. (1.2), though, of 
course, all F*’s cannot be mutually orthogonal. Newton did not write this law as a separate 
one, but it is independent of the first three laws, and was first explicitly mentioned by Daniel 
Bernoulli in 1738. 

During the past three hundred yeats, Newton’s laws of motion have been critically ex- 


Copyrighted material 



12 Classical Mechanics 


amined over and again, particularly to see whether all the three laws are independent or 
not. The most widely argued point is that the first law is a special case of the second law, 
because as we put F = 0 in Eq. (1.1), it implies that the acceleration a is zero and 
therefore guarantees rectilinear motion with constant velocity. Why then had Newton put 
it as a separate law of nature? 

At the time of Newton, there was perhaps an intellectual tradition following the divine 
idea of the Trinity in the theological framework of Christianity, that any law must have three 
aspects or three components for its perfection and completeness. There were three laws of 
the falling bodies, due to Galileo, three laws of Kepler, for the motion of planets, three laws 
of motion and three components of the law of universal gravitation, due to Newton, three 
laws of motion of the moon, due to Cassini, and so on. People were so obsessed with the 
number three, a book had to have three volumes; for example, we have three volumes of 
Principia , three volumes of the work of Copernicus, and so on. Nevertheless, Newton would 
have had sufficient arguments in favour of the first law as a law independent of the other 
two. 

Basically, there exist four different pictures or viewpoints forwarded by different people 
at different times: 

(a) Gustav Kirchhoff’s picture (1876): The second law is simply a definition of force. The 
first law is a special case of the second law and therefore is not an independent law. The 
whole of Newtonian mechanics is regarded as an axiomatic formulation from the definition 
of force as given by the second law alone. 

(b) Isaac Newton’s picture (1687): Newton does not consider the first law derivable from 
the second law. The first law gives the phenomenological definition of an inertial frame. A 
reference frame, attached to a free particle, whose phenomenological behaviour would be to 
maintain a constant velocity vector for all time, is called an inertial frame. However, inside 
a freely falling lift, which is acted upon by an external force, all objects seem to behave 
as free particles. But Newton would then argue, since these particles do not continue their 
uniform rectilinear motion for ever, the freely falling lift can not be regarded as an inertial 
frame. The second law represents the behaviour of the real world with respect to an ‘inertial 
frame’. If the observed acceleration is not explained in terms of all the known real forces, 
or in other words the second law is found to be not valid, Newton would declare that the 
frame with respect to which accelerations are measured is noninertial. And he would not 
agree that his laws of motion are not universal. So he suggests that one should, along with 
the real forces, include some fictitious forces such as the centrifugal forces or the Coriolis 
forces and so on, depending upon the particular type of the noninertial behaviour, and use 
the second law for the noninertial frames also. The equations of motion (see below) can be 
solved and their detailed predictions can be tested by doing experiments. So the second law 
is a law of nature. 

So far as the limiting behaviour of the first law to the second is concerned, the limit is, in 
this case, asymptotic in nature. The asymptotic limits are something that a correct theory 
must satisfy, but the limits themselves must be provided from some other independent 
sources. For example, without the prior knowledge of Newtonian physics, Einstein’s field 
theory would not have anything to test under asymptotic limits. Once you have these 


Copyrighted material 



Introduction 13 


limits properly set, you have the theory to proceed with. In the case of Newton’s laws the 
asymptotic limit of the second law is asserted by the statement of the first law. Once that 
is justified the first law reduces to the definition of inertial frames. In fact, according to 
Newton, the validity of all the three laws put together consistently defines the total concept 
of the inertial frames. 

(c) Bishop Berkeley (1710) and Ernst Mach’s (1883) picture: Here the assertion is that 
the second law is a law of nature and that the first law is a special case of the second law. 
They however make an additional postulate that the inertial frames are fixed or moving with 
uniform velocity with respect to the distant stars and galaxies in the universe. Without such 
a distant background reference in the sky, they argue that no one can have an idea of an 
inertial frame, let alone the idea of velocity and acceleration of a lone particle in the universe. 
If there is only one particle in the universe, the applied force is of course zero, acceleration is, 
strictly speaking, indeterminate, in which case the validity of the second law would demand 
that the mass of the lone particle in an otherwise empty universe be zero. So they insist 
that the measurement of true or inertial acceleration is possible only if there is a significant 
number of distant and heavy objects in the background. 

(d) Albert Einstein’s picture (1907): Newton’s laws of motion are valid in co-ordinate 
frames fixed or moving with constant velocity relative to a freely falling observer. The 
phenomenon of weightlessness allows one, in principle, to identify such a reference frame. 
So he argues that all the freely falling observers can serve as ‘inertial’ frames. In fact, the 
sun is falling freely in space under the action of the gravity of the Milky Way for ever — 
even the galaxies have been falling freely in the field of every other body in the universe 
since the big bang. In his general theory of relativity, the classical idea of gravitation as a 
force has been dispensed with. Thus, in Einstein’s view, the gravitational force joins the 
centrifugal and Coriolis force in the category of fictitious forces. 

All these four viewpoints are distinctly different but are all equally feasible. One has to 
stick to any one of the viewpoints and we choose Newton’s viewpoint for writing the rest of 
the book. 

In general, the force on a particle may vary as it moves in space, that is, if the particle’s 
position is described by the vector function r(t) of time and its velocity by another vector 
function v(t) of time, then the force on the particle in its successive positions is a (given) 
vector function JF(r(<), v(t),t). Assuming that the first and second order time derivatives of 
r(t) exist, we have, for acceleration, 

a(0 = f(t) (1.3) 

where, the number of overhead dots denotes the order of total differentiation with respect 
to time t. Substituting in Eq. (1.1), we get an ordinary second order differential equation 
in r(0 : 

mf(0 = F(r(0, »(<).<) (1.4) 

If r is a vector in 3-D space, Eq. (1.4) stands for three equations, one in each of its 
rectangular Cartesian components x, y and z. Each of these requires the specification of 
two constants (initial conditions) for a complete solution. Thus, if the position and velocity 


Copyrighted material 



14 Classical Mechanics 


of the particle at any instant t (say t = 0) is known, its subsequent motion can be 
completely described, provided we can solve Eq. (1.4). Furthermore, if F does not depend 
explicitly on t, that is, F = F(r(<),*(<)), then the vector Eq. (1.4) is necessarily invariant 
under the change of the sign of i so that Eq. (1.4) completely determines the motion of the 
particle in the past (t < 0). Furthermore, if the sign of time can be actually reversed, the 
particle will retrace the same trajectory (as it has traced for the positive direction of time) 
in the opposite direction. Eq. (1.4) or any of their equivalents are called Newton’s equations 
of motion. 


1.5.1 Some Examples of Force Laws and the Corresponding Motions 

From the mechanical standpoint, forces can be divided into two broad classes : (a) body forces 
which act on each point on the body (inside as well as on the surface) such as gravitation, 
and (b) surface or contact forces acting at the surface only, for example, pressure, tension, 
elastic forces at the contact, all reactions due to mutual contact between two bodies. We 
shall give here a few specific examples of force laws: 

(i) Hooke's Law (1675) 

Within a certain specified domain of space, the force F acting on a particle is attractive in 
nature and is linearly proportional to the displacement (r) from the position of equilibrium. 
That is, 

F = -kr 

where k is the constant of proportionality, called Hooke’s constant, and r is measured from 
the position of equilibrium. Following Newton’s programme, the equation of motion of any 
particle under the above force law is, 


mf = - kr 


(1.5) 


the general solution being 

r = o cos (yjk/m t) + 6 si n(y/k/m <) (1.6) 

where a and b are the constants of integration, which can be related to the initial position 
r 0 and initial velocity v a by 

r 0 = a and v„ = y/k/mb (1.7) 

The motion is, in general, elliptical with the centre of the ellipse at the position of equilib¬ 
rium. 


(ii) Newton's Law of Gravitation (1687) 

All particles in the universe attract all other particles along the line joining the two mass 
centres with a force directly proportional to the product of their masses (mi, m 2 say) and 
inversely proportional to the square of the distance between them, that is, 


F u 


Gm\ m.2 


**12 


= - F n 


( 1 . 8 ) 


where fu = ti — r 2 , F J2 is the force on particle number 1 (having mass mi) exerted by 


Copyrighted material 



Introduction 15 


the particle number 2 (having mass m 2 ) and G is the universal constant of gravitation. The 
general motion under this law has been discussed in chapter 4. A similar inverse square law 
but with a provision for both attraction and repulsion holds good for any two static electric 
charges or two static magnetic poles, given by the French physicist Charles Coulomb (1784). 

(Hi) Lorentz’s Law of Electromagnetic Force (1891) 

Any charged particle having mass m and electric charge e moving under the action of an 
electric field E(r, t) and a magnetic field of induction B(r, <) experiences a force F given 
by 

F = e(E + v x B) (1.9) 

This is essentially a two-component force, the first term representing the electric force and 
the second term the magnetic force. The latter acts perpendicular to the direction of the 
instantaneous motion of the particle. For constant E and B, the equation of motion is 

f = (e/m) (E + v x B ) 

whose exact solution has been given, for example, by F. R. Gantmakher (1960) as follows: 

/.x . 1 /cosurt - 1\ , 

r(<) = r 0 + v 0 t + - gt + ^- — 2 - J (o> x v 0 ) 

/cosurt - 1 + w 7 t 2 / 2\ 

+ - ~ 4 - J (w x (o> x *)) ( 1 . 10 ) 

( ut - sin a; A 

- ^5 -J[»xj-tfx(«x Vo)] 

where r a and v 0 are the initial position and velocity of the particle, g = eE/m is a 
constant, u> — ( e/m)B is the cyclotron frequency vector, which is again a constant. This 
path represents a spiralling orbit, the guiding centre of which describes a parabola due to 
the action of the constant E field. 

(iv) Law of Constant Force (Torricelli’s ballistics, 1640) 

Near the surface of the earth, the gravitational force of the earth on a given particle is taken 
to be approximately constant, say F = mg, where g is a constant. The equation of motion 
is given by f = g = constant, with a solution 

r = r 0 + v 0 t + ^ gt 2 
representing a parabolic trajectory, in general. 

(v) Stokes’ Law of Viscous Drag (1850) 

The drag force acting on a homogeneous sphere moving inside a viscous fluid with very low 
velocities is given by F = - 6?rqRt>, where tj is the coefficient of viscosity of the fluid, R 
is the radius of the sphere, and v its velocity. This is a linear drag force, also applicable to 
the slow motion of dust particles and aerosols in air, or to the slow motion of electrons in a 
conductor. In the presence of uniform gravity g, the equation of motion becomes 

f = - Af + g (1.11) 


Copyrighted material 



16 Classical Mechanics 


where the drag part of the force is — At>, A being a constant. The general solution is 
given by 

/I - —A*\ / „-A! , \4 1\ 

( 1 . 12 ) 


(l - e~ Xt \ fe~ xt + \t - \\ 

” \ A ) Vo + { A* ) 9 


However, for moderate and high speed motion in air, a quadratic drag law is applicable, for 
which 

F = - CopAvv 


Cd (— 0.5) being the drag coefficient, p the mass density of the fluid medium and .4 the 
cross-sectional area of the body. Finding an exact solution for a ballistic missile moving 
under constant gravity and a quadratic drag force is a formidable task. 

In fact many kinds of drag forces are possible. The drag forces proportional to v, v 2 , 
and av + bv 2 were considered by Newton himself. Later on, many others tried various 
other forms and quickly exhausted the list of all those for which the motion was analytically 
integrable. 

(vi) Coulomb’s Law of Friction (1779) 

The maximum static force of friction F, developed at the surface of contact between two 
bodies, due to a normal force of reaction N, that can prevent any motion along the boundary 
of the two surfaces in contact, is given by 


\F\ = p,\N\ 


where p, is a constant, called the coefficient of static friction between the two given surfaces. 
The force of friction acts in a direction opposite to that of the applied force. This law was 
originally proposed by G. Amontons in 1699. 

If the two surfaces begin to slide, the same law applies, but the coefficient of sliding 
friction pt is slightly lower than p 3 . However, more experiments done recently, suggest a 
small departure from Coulomb’s law, and a better empirical law, obeyed by a large variety 
of rough surfaces seems to follow the equation 

F = K'N 091 


where the constant K' depends on the nature of the two surfaces. When the forces arc 
expressed in SI units, the value of A'', is found to be 0.24 for glass sliding on wood, 0.35 
for metal on wood or metal on metal, 0.49 for wood on mica, 0.58 for glass on glass, and 
so on. Rolling friction is also assumed to follow a law quite similar to Coulomb’s law, the 
coefficient, generally, being smaller than that for sliding. 


1.5.2 An Extension of Newton’s Second Law to a System of Particles 

Consider a system of N particles having masses mj, m 2 ,..., mjv, positions ri(/),..., r^(t) 
at any instant t and moving under forces , / 2 ,... , f N externally applied on them, along 
with their mutual interaction forces (= the force on the tth particle produced by the 
jth particle). Newton’s programme can now be slightly modified so that the ensemble of 


Copyrighted 



Introduction 17 


these N particles may dynamically behave as an aggregate of total mass 

N 

M = y^rm 

i = 1 

One defines the centre of mass of such a system to be located at R, through the relation 

N 

MR = ^2 m »n 

<x 1 

The velocity V of the centre of mass is similarly defined through 

N 

MV = 

* = l 

Now it is very simple to see that 

N 

mr = 

i = 1 

where F is the total force, which is the vector sum of all the forces that are experienced by 
the individual particles. However, by the third law, the internal forces satisfy 
so that all / y cancel pairwise. Thus, F is reduced to merely the vector sum of all the 
external forces, for which the knowledge of only the external forces suffices. Therefore, with 
the quantities M, A, V and F defined as above, the dynamical behaviour of the aggregate 
can be represented by a single vector equation 

MR = F 


N N N 

= £/<+ ££/«-*' (i-w) 

i= l « = 


replacing N vector equations given by 

rmfi = fi + ^2 fij (i not summed, j summed) 
i*i 

Obviously it is easier to solve one (vector) differential equation than N such equations, 
but of course, at the cost of the details of the motion of individual particles. This is an 
important result. We can successfully talk about the motion of a body as a whole without 
requiring any knowledge of the internal forces. So the earth or a ball or a piece of stone or 
a molecule can be treated as particle in its full right guaranteed by the Eq. (1.13) whenever 
structural details are not required. 

Now in the absence of any external force F, V becomes a constant of motion and 
R = Ro + Vt 

that is, the centre of mass moves with a constant velocity V. 

One further defines the moment of momentum L of the system about the origin (of any 


Copyrighted material 



18 Classical Mechanics 


given inertial frame) by, 

N 

L — ^ run x Vi 

i = 1 

and the torque T by the total moment of the force about the origin, given by 
N N N N N 

T = x (/, + £ fij) = £ r ' x - r >) x fa 

I = 1 j ± X i = 1 I = 1 j > * 

It is now easy to see that 

N , N ,, 

r = £m<r, x r. = j t (J2 miri x "*') = ~di ( I16 ) 

This is called the torque-angular momentum relationship for a system of particles. 

Equation (1.16) is not an independent law of motion as it is derived from Newton’s laws 
of motion. Like Eq. (1.13), the validity of Eq. (1.16) is also independent of the internal 
forces, even though there is no guarantee that internal forces of non-central in character 
would not have a non-zero contribution to the measured torque given by Eq. (1.15). Thus, 
the basis of profound claims such as the connections of homogeneity and isotropy of space 
to the conservations of linear and angular momenta of closed systems (namely, the value of 
V being independent of the choice of the origin, and the value of L being independent of 
rotation of the combined set of vectors R and P in their plane, an inherent property of the 
vector product, by its definition) should be taken with caution (see p. 80). In the latter 
case, because of the explicit dependence on R , the tanslational symmetry is lost. 


(1.14) 

(MS) 


1.5.3 Work, Power, and Kinetic and Potential Energies 


If the point of application of force F which is in general a function of r, v and f, that is , 
F = F(r,v,t ), is displaced by an infinitesimal amount dr, an infinitesimal amount of work 
dW done by the force is defined by the scalar product 


dW = Fdr 


(1.17) 


For any finite displacement between r = rj to say r = r 2 , a finite amount of work is 
done and is given by integrating Eq. (1.17) over the given path connecting n and r 2 as 
its end points. Unless F is uniform over the path, W = F ■ r is as incorrect a s r = vt, 
or, v = at. (But F can be assumed to be uniform only over an infinitesimal displacement 
dr. Had we strictly defined W = F ■ r, it would have meant dW = F • dr -f- dF ■ r, which 
is non-sense. Similarly, the mass element must be defined as dm = p dr 3 .) 


The power P attained at any instant by the agency that produces the above force F , is 
defined by 


P 


dW 

dt 


Fv 


(1.18) 


Copyrighted material 



Introduction 19 


Applying Newton’s second law and assuming a constant mass, 

p = m £-’=S<5’ m ’-’’ ) = § < U9 > 

through which the kinetic energy is defined as 

T = ^mv 2 (1.20) 

where m is the mass of the particle on which the force F is applied. The quantity mv 2 
used to be called vis viva by Leibniz (1695). Later on someone coined the term ‘kinetic 
energy’. The definition given by Eq. (1.20) is however taken to be true irrespective of the 
fact whether m is a constant or not. 

For any arbitrary force function F = F(r,v,t), dW in Eq. (1.17) cannot be a perfect 
differential, and Wu is not only the function of the end points r*i, r *2 but also of the path 
chosen to connect rj and rj. Now suppose, F is independent of v, then there can exist a 
potential energy function V(r,t) such that, 

F(r,t) = — VV(r,t) (1.21) 

in which case, Eq. (1.17) becomes 

( dV dV\ dV 

lit ~ lit) dt = ~ dV ^ + (L22 > 

So, the condition for Fdr to become a perfect differential is that F should be independent 
of both v and t and the corresponding potential energy function given by V(r) is called 
a conservative potential energy function. From Eqs (1.19) and (1.22) with dV/dt = 0, one 
finds, 

j t (T + V) = 0 (1.23) 

or, in other words, the stun of the kinetic energy and the conservative potential energy, 
called by definition the total energy of the system , 

E = T + V (1.24) 

is a constant of motion. Galileo had noticed that the speed of a particle on an inclined 
plane starting from rest depends only on the vertical height through which it has descended. 
The principle of the conservation of total energy has, since then, been gradually develope- 
d through the works of Huygens, Newton, Bernoulli(s) and Lagrange. Its link with the 
symmetry with respect to time translation is apparent from the requirement of dV/dt = 0. 

The zero of the potential energy function can, however, be chosen arbitrarily. If the zero 
of any conservative potential energy is chosen to be at r = r 0 , then 

V(r) = - ^ F(r) • dr (1.25) 

For the inverse square law of gravitational fields, the tip of r Q is usually chosen to be at oo, 
but for Hooke’s type of the force field, at the origin. The idea of the potential function was 


Copyrighted 



20 Classical Mechanics 


first introduced by Lagrange in 1773 and the term ‘potential’ is due to Green (1828). 


1.5.4 Equations of Motions for Variable Mass 


Since Newton’s second law is given by 


dp 

dt 


F exi 


where p = mv\ these two can be combined to give 

”>§ + «£ =< I26 > 
If the mass of the system under consideration is changing with time dm/di ^ 0, the Eq. 
(1.26) is the equation of motion of such a system. 

However, for the motion of a rocket of mass m(<), that burns fuel and ejects the burnt 
gas with relative velocity it and at a rate - dm/dt , the equation of the motion of the rocket 
is simply given by 

= F - iL27) 


1.6 LIMITATIONS OF NEWTON’S PROGRAMME 

The direct application of Newton’s laws in solving problems of mechanics has many limita¬ 
tions. 


1. Newton’s laws are valid only in inertial frames, which are by definition rectan¬ 
gular Cartesian-like. Therefore, the equations of motion have to be set up and solved 
in rectangular Cartesian-like coordinates. The equations of motion would then look like 
mii = Fix * = 1, 2, 3; where F,- is the tth component of the external force applied 
on a particle of mass m. Now suppose we want to write down the equations of motion in 
spherical polar coordinates, which have got the polar axis as the third axis of the Cartesian 
system. We may intuitively imagine that the radial coordinate r measuring distance in the 
radial direction might be as good as any one of the Cartesian axes, and be tempted to write 
the equation of motion in the radial coordinate as mf = F r , where F r is the component 
of the external force in the radial direction. But obviously this equation cannot be right; 
for if it was, a planet could never, in principle, revolve around the sun in a circular orbit, 
in which case F r ^ 0 will imply r ^ 0, f ^ 0 and hence the value of r must change 
with time. So Newton’s second law is not valid even for the r coordinate of a spherical 
polar coordinate system. If we now force it to be valid, we have no other choice than to 
invent and add an imaginary force term to the real external force term in order to totally 
nullify the latter, thus justifying a constant value of r. Such imaginary forces, which have to 
be incorporated because of the noninertial nature of the coordinate frames used, are called 
pseudoforces or inertial forces. 

However noninertial frames cannot be avoided in many of the most important applications 
of mechanics. For example, a reference frame attached to the earth rotates with it around 


Copyrighted material 



Introduction 21 


its axis of rotation and is hence noninertial. The motion of any object with respect to the 
earth is, strictly speaking, a motion in a noninertial frame. This situation is handled by 
establishing a connection between the noninertial and any one inertial frame. As a result, 
some inertial force terms do appear in the transformed version of Newton’s equations of 
motion in the noninertial frames. Pseudoforces cannot be associated with the interactions 
of the object and its environment as the real forces can be. Real forces never change as 
a result of transformation of reference frames. Examples of pseudoforces are centrifugal 
force, Coriolis force, etc. We shall deal with this problem in the chapter on rotating frames. 
Extended bodies, being rigid or elastic or fluid in nature, can move in any manner allowing 
very complicated rotations in them. Newton’s second law of motion has to be modified for 
such systems. The last three chapters are devoted to the dynamics of rigid, elastic and fluid 
bodies. 

2. The most inconvenient aspect of the use of Newton’s laws of motion is that they are 
restricted to Cartesian frames and that they deal with the dynamical problems in terms of 
the forces only. In the subsequent chapters we shall present treatments that do not refer 
to forces at all and coordinates that are more natural (to the given situation) than the 
Cartesian ones, can be chosen. In these formulations, the force function is replaced by a 
suitable potential function. Chapter 2 deals with such a formulation. 

3. As we have seen, each of the three Eqs (1.4), being a second order differential equation, 
always requires two initial values to be specified. At each point of the orbit the differential 
equations tell us how the state changes differentially over an infinitesimally small interval 
of time. Since the classical world is totally deterministic, it allows us to predict the dynam¬ 
ical evolution of a system both by differential and integral techniques. Such integral and 
variational techniques seem to be far more general than the differential ones. Chapter 6 has 
been devoted to the development of Hamilton’s principle and the variational approach to 
classical mechanics. 

4. The Newtonian definition of linear momentum, that is, p = mv and his third law of 
motion together suggest that the total momentum of a pair of mutually interacting particles 
is conserved. It was later found that for two charged particles moving under the mutual 
forces of action given by Lorentz force (see Eq. (1.9)), the quantity Pi + ft = m\V\ + 7712 ft 
is no longer a constant of motion. What happens is the following: When a charged particle 
moves, it behaves like an electric current and therefore, produces a magnetic field around 
it. This magnetic field exerts a magnetic force on the other charged particle if the latter is 
moving. So the momentum of the second particle changes, but not the energy. Now what 
agent transfers this momentum? Surely it is the magnetic field produced by the first particle, 
which would not have existed had the first particle been at rest with respect to the second 
particle. The net result is that the magnetic field itself must carry some momentum so that 
it can impart some momentum to the charged particle. So if we want to save the third law of 
motion or the principle of conservation of momentum for the motion of charged particles, the 
Newtonian concept of momentum has to be revised. We shall see in the following chapters 
how this problem is resolved in the Lagrangian and the Hamiltonian formulations, through 
the definition of canonical momentum. 


Copyrighted material 



22 Classical Mechanics 


5. Application of Newton’s laws of motion requires the specification of all forces acting 
on the object at all instants of time. In real situations, particularly when the constraint 
forces are involved, this can be a formidable task. The problem is dealt with in the next 
chapter, that is chapter 1. 

6. Newton’s laws are based on the concepts of absolute time, absolute space, absolute 
simultaneity of events and infinite speed of propagation of information. These are not 
supported by current theories, which require, in particular, that the measure of inertia or 
mass is not an absolute quantity but that only the rest mass is an invariant characteristic 
of material objects. In this book we have not given any systematic development of special 
relativistic mechanics, although occasionally examples and results are given just to stress 
the important differences that they lead to. 


1.7 SUMMARY 

Certain key points have been emphasized in this chapter. A classically defined particle is 
always assumed to have no structure, and hence can have no moment of inertia, and no spin. 
But a quantum particle can have spin without a structure. Another point we have made 
is that in physics, very often we plot graphs for representing the motion of a particle using 
any arbitrary scale for graduating the axes. This is possible because any finite length of a 
line contains as many points as there are real numbers between any two real numbers. Since 
physical quantities are all expressed in terms of some numbers, they can be geometrically 
plotted on a graph. Moreover, the choice of units can also be absolutely arbitrary because 
there can be an exact one to one correspondence of the total set of real numbers between 
any two arbitrary intervals of real numbers, as also of the total number of points between 
any two segments of arc, surface or volume elements. 

In the history part, we want to emphasize one point; Galileo was the father of modem 
science, and he broke the age long tradition of accepting pedagogical reasoning in preference 
to directly testable results of experiments. His successor Newton had the best combination 
of both a theoretical mind and experimental hands. 

Newton’s laws of motion are accepted as a set of axioms. The first law like the other two 
is an axiom, that is, an unproved and unprovable assumption, which Newton proposed as a 
useful way of thinking about the world in order to make sense of the vast variety of motions 
that are observed. An axiom is judged by its proposer to be so basic and so fundamental 
that there is nothing more basic or fundamental with which it can be proved. It is a concept 
invented for the purpose of proving a starting point of thinking about the phenomena at 
hand. In fact any attempt to justify an axiom is ultimately bound to be circular, for how 
can one begin other than at the beginning? The ultimate test of whether or not the axioms 
achieve what is hoped to be achieved is, how successful they are at predicting observable 
results that can be checked with actual phenomena. They represent a complete strategy for 
solving a variety of dynamical problems, not all however. The concepts of inertial frame, 
conservation of linear momentum, closed systems, and symmetries of space and time were 
explained. 

Nevertheless, it is apparent that Newton’s programme is tedious and far from simple, 


Copyrighted 



Introduction 23 


particularly in noninertial situations. The motion of rigid bodies, or of continuous elastic 
or fluid media does not lend itself to simple ways of tackling it. The greatest disadvantage, 
however, was that Newton had always tried to resolve most problems geometrically, rather 
than analytically. For the constrained motions, the determination of all the unimportant 
reaction forces was a great nuisance, which Newton was fully aware of, but apparently could 
not suggest practical solutions to. 

The rest of the book aims at presenting a number of alternative and superior techniques 
that have been invented and mastered by Lagrange, Euler, Hamilton, Poisson, Jacobi, and 
others. The analytical formulation of dynamics has literally begun with the works of Euler 
and Lagrange. 

In many Indian universities, the undergraduate syllabi for classical mechanics include 
topics like dynamics with constraints, Lagrangian and Hamiltonian formulations, central 
force and rigid body dynamics. The first five chapters and the first 9 sections of chapter 
12 are devoted to these topics. However, we discovered that many colleges did not teach 
these advanced topics in spite of their being an essential part of the undergraduate honours 
course. The time allocated for teaching classical mechanics at the Master’s level is also not 
sufficient to cover all the required back logs. This book can be of use in this context also. 

At the end of each chapter, a number of problems and exercises are suggested, with some 
hints and answers at the end of the book. Since the intended level of the book is a non¬ 
elementary one, a pre-requisite is a knowledge of classical dynamics to the level of, say, 
Halliday and Resnick’s treatise on the same subject. We offer 50 introductory problems 
right below to be solved without (take full credit, say 2 points each) or with the help of 
the hints provided (take half credit, say 1 point each) before starting with chapter 1 of the 
book. One may, of course, read Appendices Al, A2 and A3 and parts of chapters 3, 4, 12 
and 14 even before beginning with the chapter 1, which may help solve some of the following 
problems. The sequence of the problems is not chosen in order of increasing or decreasing 
difficulty, so don’t stop even if you cannot solve any particular ones in the beginning. The 
problems are of two kinds; some are precise, while others are approximate, so that only 
order of magnitude calculations would suffice to serve the purpose of posing the problem. 
This is also true of the problems given at the end of all other chapters. 


PROBLEMS 

1.1 A system of natural units formed out of h, c, G and k (Boltzmann’s constant) is 
called Planck units. Find the Planck units of length, time, mass and temperature. 
Compare the Planck length with the Compton wavelength of a Planck mass particle 
and the peak wavelength of blackbody radiation having a temperature equal to the 
Planck temperature. 

1.2 A car is naturally sliding down on an incline that makes an angle a with the hori¬ 
zontal. A ball is thrown out from the moving car in a direction perpendicular to the 
plane of the incline. Will it return to the car ? What if the wheels share a mass 
tarn a times the mass of the car ? 


Copyrighted 



24 Classical Mechanics 


1.3 Sit on a chair which is connected to a rope of negligible mass passing through a 
pulley fixed on the ceiling and hold the other end of the rope in your hand. Can you 
pull yourself up ? What is the force N that you exert on the chair ? Under what 
circumstances can N vanish ? 

1.4 A massive object is suspended by a cord and an identical cord is attached to the 
bottom of the object and dangles below it. A downward force may be applied to the 
lower end of the lower cord. Show that the upper cord breaks with a slow steady pull 
(on the lower cord) and that the lower cord breaks with a quick jerk. 

1.5 Describe the 3-D motion of a pendulum bob hung by a rubber band. What are the 
frequencies of its vertical and horizontal oscillations ? 

1.6 Two particles in a uniform gravitational field have initial positions and velocities 
ri, *i, f *2 and t >2 respectively. Using these initial conditions alone, state a test for 
determining whether the particles will collide. 

1.7 A basketball is thrown vertically downward from the top of a tall building and it lands 
on the street below. How high will it bounce ? Take the diameter of the ball to be 0.3 
m, its mass 0.7 kg, density of air 1.29 kg/m 3 , the drag coefficient for the quadratic 
drag law Cp = 0.5 (Drag force, F = CoApv 2 /2, A = area of the body that 
faces the drag, p = density of the fluid, t> = speed of the body). 

1.8 A stone of mass m is projected vertically upward from the ground level with an initial 
speed Vi. Assuming a quadratic law of drag force, show that 

(1) the speed of return to the ground «2 is always less than »i, and 

(2) the time of descent t 7 is always greater than the time of ascent t\. 

1.9 A normal human heart pumps about 5 litres of blood per minute at a systolic pressure 
of 120 torr. What is the minimum power of the heart required for pumping blood 
alone ? 

1.10 Suppose we sweat away 2.5 litres of water every day at a normal body temperature of 
37°C (take the outside temperature to be 25°C). What would then be the minimum 
calorie requirement of the body ? Assume the heat of combustion for food or fuel 
carbon to be 10 Kcal/gm. How much oxygen is to be breathed to digest our daily 
food ? If we breathe 16 times a minute with 30% efficiency in the consumption of 
oxygen, what is the required lung capacity of an average human being ? 

1.11 If you remove all the electrons from a rain drop (diameter = 1 mm), what would be 
the gain in the electrostatic potential of the entire earth ? Take the radius of the 
earth to be R = 6378.14 km. 

1.12 The standard speed of the tape in play mode is about 4.76 cm/s. The thickness of 
the tape is 1.15 x 10 -2 mm. There is an index meter which reads in proportion with 
the length of the tape run. The maximum index reading is 730 for a tape run of 45 
minutes. The inner hub radius of the tape mount is 1.35 cm. If a particular song 
starts at the index number 220, find the outer radii of the two discs of the tape at 
that moment. 


Copyrighted material 



Introduction 25 


1.13 Two vehicles are bumper to bumper at a stop signal. The lead vehicle moves with a 
constant acceleration a for time T. The second vehicle follows the first but maintains 
a separation distance proportional to its own speed. Describe the motion of the second 
vehicle during time T. 

1.14 The famous astrophysicist from Cambridge, Steven Hawking, suggested in 1975 that 
a black hole of mass M and radius R (= 2 GM/c 2 ) emits like a black body with 
a temperature T = hc*/8GMk (ft = Planck’s constant/27r, k = Boltzmann’s 
constant, c = speed of light in vacuum). How long will it survive ? 

1.15 The most distant object that we have seen so far is a quasar sitting at a distance of 
about 15 x 10® light years away from us. Assuming that the universe is a big black 
hole of the above radius, find the average mass density of the universe. How many 
galaxies are there in the universe if the mass of individual object is about 5 x 10 41 
kg? 

1.16 Show that time taken for unhindered gravitational collapse of any homogeneous spher¬ 
ical body does not depend on its size or mass, but only oh the density. Find the 
collapse times for the earth, sun, moon, an interstellar cloud and the universe, the 
densities being 5.51, 1.41, 3.34. 10 -23 , and 10 -29 (in units of gm/cm 3 ), respectively. 

1.17 Suppose the sun contracts to a pulsar. Estimate the minimum radius of the pulsar 
and its period of rotation. Assume the period of rotation of the sun to be 25.38 days. 
Compare the kinetic energy of rotation of the star with that of the pulsar. What is 
the source of this increased kinetic energy? Take the radius and the mass of the sun 
to be 7 x 10® m and 2 x 10 3 ° kg, respectively. 

1.18 From a 100 m high tower, a boy stretches the rubber cord of his catapult so that it 
becomes 10 cm longer, and projects a stone of mass 20 gm at an angle 30 degrees 
with the horizon. Find the amount of heat generated when the stone hits the ground. 
Stretching of the cord by 1 cm requires a force of 1 kgf. Disregard the resistance of 
air. 

1.19 Find the optimum speed (u 0 ) and angle (0 O ) with the horizontal for netting a basket 
ball at height ft and distance L. Show that 9 a is greater than 7 t/4 by an amount 
tan "‘(ft/I). 

1.20 A ballistic missile is fired with an initial speed v„ up the slope of a hill that has 
an angular elevation with the horizontal. Find the maximum range of the missile 
along the hill slope and show that it ensures that the direction of hitting the target 
is normal to the direction of firing. 

1.21 Use the same quadratic law of drag force as in problem number 1.7 above (but with 
Cd = 0.7) to calculate the power required for swimming under water at speed 1.7 
m/s. What would be the speed of a cyclist (Cd — 0.9) if he consumes only a tenth as 
much power in combating the air drag ? (Assume the total surface area of an average 
athlete’s body =1.1 m 2 .) 


Copyrighted 



26 Classical Mechanics 


1.22 For non-spinning high speed golf balls the force of air drag is roughly linear with 
velocity ( Fd = Cv). Assume that C/m = 0.25 s -1 , m = 46 g and that the 
maximum horizontal range of 152 m is obtained with an initial speed of 61 m/s. Show 
that the angle of striking has to be 32 degrees with the horizontal, whereas in absence 
of any air drag it would have been 45 degrees. 

1.23 Find the kinetic energy of a cyclist riding at a speed of 9 km/h. The cyclist with his 
bicycle weighs 78 kgf, and the wheels 3 kgf. Consider the bicycle wheels as hoops. 

1.24 Find the maximum deflection of a leaf spring caused by a load placed on its middle, 
if the static deflection of the spring due to the same load is x„ = 2 cm. What will 
the maximum initial deflection be if the same load is dropped onto the middle of the 
spring from a height of h = 1 m with zero initial velocity. 

1.25 The kinetic energy of a neutron diminishes 1.4 times when it collides elastically and 
centrally with the stationary nucleus of a moderating material. What is the moder¬ 
ating element ? 

1.26 In the reaction 7V M (a , p)0 17 (Q value = - 1.18 MeV), the kinetic energy of an 
alpha particle, E = 7.7 MeV. Show that the angle with the direction of motion of the 
alpha particle at which proton escapes if its kinetic energy E p = 5.5 MeV, is about 
54 degrees (Take the mass data from any book on nuclear physics). 

1.27 A centrifuge is used for the separation of isotopes of Uranium in one of its natural gas 
compounds, called Uranium hexafluoride (UF<j). The gas in natural isotopic mixtures 
( 238 U : 235 U = 139 : 1) is placed inside a cylindrical vessel rotating at a high speed. 
The centrifugal potential determines the Boltzmann-like barometric distribution of 
the gas isotopes. Compare the concentrations of light and heavy Uranium isotopes 
near the centrifuge walls, if the diameter of the cylinder is 10 cm, the rotation speed 
is 2000 rps, and the temperature of UF<j compound is 27"C. 

1.28 A centrifugal governor is of the form shown in Fig. 1.1. The mass of each weight is 
m and the spring constant k. Will this device work in a condition of weightlessness? 
What is the dependence of angle a on the speed of rotation of the system ? 



Pig. I.l Diagram for problem no. 1.28 


Copyrighted 






Introduction _ 27 


1.29 The surfaces of two cylinders made of aluminium (solid) and of lead (hollow) having 
the identical radius (r = 6 cm) and weight (W =0.5 kgf) are painted with the same 
colour (the densities of A1 and Pb being 2.7 and 11.2 times that of water). 

(1) How can the cylinders be distinguished by observing their translational velocities 
at the base of an inclined plane? 

(2) Find their moments of inertia. 

(3) How much time does it take for each cylinder to roll down the inclined plane 
without slipping ? Height of the inclined plane h = 0.5 m, the angle of inclination 
a = 30 degrees, and the initial velocity = 0. 

1.30 A spherical bowl of radius R rotates about the vertical diameter. The bowl contains 
a small object whose radius vector in the course of rotation makes an angle a with 
the vertical. What would be the minimum angular velocity u of the bowl in order to 
prevent the object from sliding down, if the coefficient of static friction is /*,? 

1.31 Twenty drops of lead were formed when the lower end of a vertically suspended lead 
wire of 1 mm in diameter was melted. By how much did the wire become shorter ? 
The coefficient of surface tension of liquid lead is 0.47 N/m. Assume that the diameter 
of the neck of a drop at the moment it breaks away is equal to the diameter of the 
wire, and the density of lead = 11,200 kg/m 3 in both the phases. 

1.32 A ‘hula hoop’ of mass M and radius R is started across a level lawn with its centre of 
mass moving at linear speed v a and a backspin u 0 . The coefficient of sliding friction 
is a constant. Show that if the hoop is to come back toward the starting point, it is 
necessary that u 0 > v 0 /R. 

1.33 A uniform cylinder of radius R is spun about its axis, to the angular velocity u> 0 , 
and then placed in a rectangular comer. The coefficient of friction between the corner 
walls and the cylinder is K. Show that the cylinder accomplishes n turns before it 
stops, where 

(1 + K 2 )ulR 

" 8ngK(l + K) 

1.34 A uniform solid cylinder of radius R rolls over a horizontal plane passing into an 
inclined plane forming an angle of depression a with the horizontal. Find the maxi¬ 
mum value of the speed v a which still permits the cylinder to roll onto the inclined 
plane without a jump. The sliding is assumed to be absent. Show that there will 
always be a jump if a > cos -1 (4/7), no matter how slowly the cylinder rolls down 
across the slant. 

1.35 A uniform ball of radius r rolls without slipping down the top of a sphere of radius 
R. Show that the angular velocity of the ball at the moment it breaks off the sphere 
is given by 

u = [10 g(R + r)/17r 2 ] 1/2 

The initial velocity of the ball is assumed to be negligible. 


Copyrighted material 



28 Classical Mechanics 


1.36 A chain AB of length / is located in a smooth horizontal tube AC so that its fraction 
of length h hangs freely and touches the surface of the table with its lower end B. At 
a certain moment, end A of the chain is set free. With what velocity will this end of 
the chain slip out of the tube ? 

1.37 A spaceship of mass m„ moves in the absence of external forces with a constant 
velocity v„. To change the motion direction, a jet engine is switched on. It starts 
ejecting a gas jet with velocity tt which is constant relative to the spaceship and at 
right angles to the spaceship motion. The engine is shut down when the mass of the 
spaceship decreases to m. Through what angle a did the direction of the motion of 
the spaceship deviate due to the jet engine operation ? 

1.38 A rocket of initial mass M, (which is equal to the sum of the payload mass Mf 
and the fuel) and velocity t>,• ignites all its fuel with a constant velocity of ejection 
- u 0 (vi/vi) with respect to the rocket. Find its final velocity v / if it is moving 

(1) in free space, 

(2) against a constant gravity field g , and 

(3) against the earth’s gravity field ( obeying the inverse square law of distance with 
respect to the centre of earth). 

1.39 A horizontal plane supports a stationary vertical cylinder of radius R and a vertical 
disc A attached to the cylinder by a horizontal thread AB of length An initial 
velocity v„ is imparted to the disc perpendicular to the straightened string. How 
long will it move along the plane until it strikes against the cylinder ? The friction is 
assumed to be negligible. 

1.40 A wheel of radius 6 is rolling over level ground at a constant forward speed v„. A bit 
of mud breaks loose from the rim of the wheel. What is the greatest height above the 
ground that a piece of mud can reach ? Is there any critical speed of the wheel below 
which the mud will not leave the wheel at all ? 

1.41 A vertically oriented uniform rod of mass M and length l can rotate about its upper 
end. A horizontally flying bullet of mass m strikes the lower end of the rod and gets 
stuck in it; as a result, the rod swings through an angle. Assuming that m is very 
much less than M, find 

(a) the velocity v of the flying bullet; 

(b) the momentum increment in the system ‘bullet-rod’ during the impact, what 
causes the changes of that momentum; 

(c) at what distance x from the upper end of the rod the bullet must strike for the 
momentum of the ‘bullet-rod’ system to remain constant during the impact ? 

1.42 A rigid symmetrical dumbbell having two spheres of mass m/2 each and connected by 
a rigid rod of length /, is floating freely inside a spaceship under ‘no-gravity’ condition. 
Now a ball of mass m and speed v a collides elastically with one of the spheres of the 
dumbbell at an angle 90 degrees to the axis of the dumbbell. Show that the resultant 
angular momentum of the dumbbell with respect to its centre of mass is exactly two 
thirds of the initial angular momentum of the ball with respect to the centre of mass 
of the dumbbell. Neglect the mass of the rod. 


Copyrighted material 



Introduction 29 


1.43 A point moves in the plane so that its tangential acceleration f t = a, and its normal 
acceleration f n = bt 4 , where a, b are positive constants and t is the time. At 
t = 0, the point was at rest. Find the curvature and the total acceleration / as 
function of the distance covered a. (If necessary, read Appendix A2 before trying the 
problem.) 

1.44 When a small ball of mass m is placed on top of a large ball of mass M and they 
are dropped together, the small ball rebounds much higher than its original height. 
If the coefficients of restitution be e\ and ei for the collisions of the large ball and 
the small ball respectively, show that the height amplification is given by A 2 , where 

A - ( 1 + e i)( 1 + e a) _ i 
1 + m/M 

For ideal elastic rebounds of a stack of 3 balls of masses mi < m 2 < m 3 with mi 
on the top and dropped from the height h,show that the maximum height attained 
by the top ball is 49 times h. (The coefficient of restitution of any elastic collision is 
defined to be the ratio of the relative velocities of the two bodies immediately after 
the collision to that immediately before the collision. This is an experimental law first 
given by John Wallis in 1668.) 

1.45 For a vertical free fall from rest under a constant g , the air drag force, be it linear 
or quadratic, leads to a terminal speed, say v t . For a given v t , find the speed and 
distance traveled as functions of t and show that the quadratic drag law leads to a 
larger velocity and larger displacement at any time t than the linear drag law. 

1.46 Show that the value of g should increase as we go deeper into the earth’s crust because 
the density of the crust is less than two thirds of the average density of the earth. 

1.47 Suppose there is a fifth force in the universe, due to which Newton’s law of gravitation 
is modified by a Yukawa type potential, so that the potential due to a point mass m 
at a distance r is, 

V(r) = - (l + <xe~ r ' A ) 

where Goo is the usual Newtonian constant of gravitation valid here for r —♦ 00 , A 
is the range of short range gravitation and a is its strength. Because of this modified 
form of the potential, there will be a local gravity anomaly A g(z), z being the 
depth below the surface of the earth. Obtain an expression for g(z) and show that 
the effective constant of gravitation deep inside any mine, that is, for z » A, would 
be given by G m \ nc = Goo(l + a). 

1.48 Two particles of mass mj and m 2 having velocities (in one dimension ) u\ and ti 2 
respectively, collide and their velocities after collision become t>i and V 2 - If you write 
the results in matrix notation with u = (ttj, U 2 ) and v = (n, U 2 ), then show that 
there exists a 2 x 2 matrix M such that v = Mu. Show that M 2 = I (Identity 
matrix) with real eigenvalues A = ± 1 . What is the significance of the characteristic 
vectors ? 


Copyrighted material 



30 Classical Mechanics _ 

1.49 A cylinder rolls down a rough incline PC, moves on to a horizontal plane (made up 
of same material as that of the incline) up to distance say CB, and stops at B. If the 
foot of perpendicular for P is A on the line CB, show that the coefficient of friction 
of the incline and the plane is given by the ratio of the lengths AP and AB. 

1.50 A damped harmonic oscillator has the equation of motion of the form x + 2Ax + 
u t 2 x = 0. Construct matrices X = (x, x) and 



so that the equation of motion reduces to a matrix equation X = AX with matrix 
solution 

X = e M X a 

Find the matrix e At . What happens when u) -* A ? 


Copyrighted mat 




1 

Constrained Motions in 
Cartesian Coordinates 


1.0 INTRODUCTION 

Sir Isaac Newton conceived the greater part of his monumental work, The Principia before 
he was 23, even though it got finally published when he was 44. After the publication of The 
Principia in 1687, he lived for another 40 years, during which time he paid little attention 
to improvise his scheme of dynamics, in order to make it usable under the circumstances 
of constrained motions. There was one great weakness in Newton’s personality, because of 
which he cultivated a jealous proprietary interest in every object he studied, and almost 
every achievement of his creative life was accompanied by some quarrel. He got himself 
bitterly engaged in fighting out issues like who between him and Robert Hooke was the 
real discoverer of the inverse square law of gravitation or who between him and Leibnitz 
was the true inventor of the differential calculus. He had a heated exchange of abuse with 
Jean Bernoulli on the deficiencies of the Principia , with Giovanni Rizzetti for the latter’s 
challenge of Newton’s optical experiments, with Johann von Hatzfeld on perpetual motion, 
and so on. He was also deeply religious, and spent a reasonable portion of the last 20 years 
of his life in preparing a book on the Chronology of Ancient Kingdom Amended. We shall 
see in this chapter, how incomplete was the scheme he gave for tackling dynamical problems, 
particularly the ones involving hindered motions, that is, motions hindered by the presence 
of hard surfaces. A genius as he was, could have suggested practical remedies, and perhaps 
one did not have to wait for the first remedy to come from Jean le Rond D’Alembert as late 
as in 1743. 

Newton’s approach was highly geometrical. At that time, mathematics was of course in 
such a primitive state that an analytical treatment of dynamics was virtually impossible. 
Differential calculus was made available to the scientific community through the publication 
of Leibniz in 1686. The integral calculus was formulated by Jean Bernoulli around 1690. 
In 1718, Jean Bernoulli formally defined ‘functions’ as variables; the notation /(z) was 
introduced by Euler a few years later; Taylor’s series was published by Brook Taylor in 
1715; partial differentiations and partial differential equations were introduced by Euler in 
1734; the hyperbolic functions were introduced by Riccatti in 1757 and the trigonometric 
functions by Wallis and Lambert in 1768. 

Born in Paris as an illegal son of aristocrats and found near the Church of St. Jean 


Copyrighted material 



32 Classical Mechanics 


le Rond, D’Alembert (1717 - 1783) spent his childhood and youth in the house of his 
foster-father, a glazier. His natural father, Chevalier Destouches, was forced by the law to 
support the boy with an annuity. When it became apparent that the boy was a genius, 
his mother wanted to take him back, but the boy refused to accept her as his mother and 
continued to live with his foster-parents. At the age of 24, he was admitted to the French 
Academy of Sciences, and within two years he published his book on mechanics, Traiii de 
Dynamique. He believed in actions only by gravity or impact and devised a method of 
reducing practically all problems of dynamics to ones of statics by superposing additional 
forces corresponding to those which represent the actual accelerations. The present chapter 
is meant to elucidate the basic problems of constrained motions in the Newtonian scheme 
and some practical solutions to them, as suggested over the past three hundred years by 
people like D’Alembert, Lagrange, Gauss, Gibbs, Hertz and Appell. Paul Appell’s magnum 
opus was Traite de Mecanique Rationnelle, published in 1909. 


1.1 CONSTRAINTS AND THEIR CLASSIFICATION 

In many real-life situations the object in motion is restricted or constrained to move in such 
a way that its coordinates and or velocity components must satisfy some prescribed relations 
at every instant of time. These relations can be expressed in the form of either equations 
or inequalities. For example, the motion of the centre of mass of a billiard ball of radius R 
moving on a billiard table of length and breadth, 2a and 2b respectively, must satisfy 

-a + R<x<a-R -b + R<y<b-R z — R 

assuming that the origin of the coordinate axes is at the centre of the rectangular table and 
x and y axes are parallel to length and breadth respectively. This is a set of one equation 
and two inequalities, which the motion of a billiard ball is to satisfy at all instants of time. 

Most physical realizations of constrained motion involve surfaces of other bodies, for 
example, that of the billiard table in the above example. Similarly, a train running along 
the rails or the motion of a simple pendulum in a vertical plane define a constrained motion 
in one dimension. Physically, constrained motion is realised by the forces which arise when 
the object in motion is in contact with the constraining surfaces or curves. These forces, 
called constraint forces, are usually stiff elastic forces at the contact. The basic properties 
of constraint forces can be summarised with the following points: 

(i) They are elastic in nature and appear at the surface of contact. They arise because 
the motion defined by the external applied forces is hindered by the contact. 

(ii) They are so strong that they barely allow the body under consideration to deviate 
even slightly from a prescribed path or surface. This prescribed path or surface is called 
a constraint. The scalar equations that describe or prescribe the surface of constraint are 
called constraint equations. 

(iii) The sole effect of constraint force is to keep the constraint relations satisfied. 

Constraints are classified into different types and classes, based on four criteria, namely 

(I) whether they are time dependent or time independent, (II) whether they are integrable 
algebraic relations among the coordinates or nonintegrable ones, (III) whether they are 


Copyrighted 



Constrained Motions in Cartesian Coordinates 33 


conservative or dissipative, and (IV) whether they are algebraic equations or algebraic in¬ 
equalities. Every constraint relation must be characterised by these four labels each of which 
has got a binary option. Table 1.1 summarises the classifications of the constraint relations. 

Table 1.1 Classification of Constraints 
A constraint is 

I. Either Scleronomic : constraint relations do not explicitly depend on time, 

or Rheonomic : constraint relations depend explicitly on time, 

and 

II. Either Holonomic : constraint relations are or can be made independent of velocities 
or Nonholonomic : constraint relations are not holonomic, 

and 

III. Either Conservative : total mechanical energy of the system is conserved while per¬ 

forming the constrained motion. Constraint forces do not do 
any work, 

or Dissipative : constraint forces do work and total mechanical energy is not 
conserved, 

and 

IV. Either Bilateral : at any point on the constraint surface both the forward and 

backward motions are possible. Constraint relations are not in 
the form of inequalities but are in the form of equations, 
or Unilateral : at some points no forward motion is possible. Constraint rela¬ 

tions are expressed in the form of inequalities. 


Properties of constraints: 

(i) Just by looking at the constraint relation it may be possible to determine the type 
qualification for the classes I, II and IV, but the determination of the type qualification 
for class III depends on whether the constraint forces do any work while maintaining the 
constraint relation throughout all stages of motion. 

(ii) It may so happen that the constraint relation contains velocities but can be integrated 
with respect to time so that the resulting relation is made free of velocities. In such cases 
the constraint is holonomic. For example, the constraint equation 

[yz - 2x + y)x 4- (xz - 2y -f x)y + xyz = 0 
can be integrated to 

(1 -f z)xy = x 2 + y 2 + c 


Copyrighted material 



34 Classical Mechanics 


so that this constraint is holonomic. 

(iii) The general form of the unilateral constraint can be written as 

> 0 ( 1 . 1 ) 

where r, and f» are the position and velocity of the ith particle of the system in motion. 
Whenever the state of motion of the system is such that, for the scalar function / above, 
the condition / = 0 is satisfied we say that the constraint is taut. The motion of a 
system with unilateral constraints can be divided into portions so that in certain portions 
the constraint is taut and the motion occurs as if the constraint were bilateral, and in other 
portions the constraint is not taut and the motion occurs as if there were no constraints. 

However, in this book, we shall be considering constraints that are almost exclusively 
bilateral, unless stated otherwise. 

(iv) Forces of constraints: We have already mentioned in the Introduction chapter that 
Newton’s second law of motion is a complete law of nature. The relation between the 
observed acceleration of an object, and the total force it is subjected to, is fixed in any 
inertial frame. But somehow one has to specify this total force which Newton has not been 
able to for arbitrary dynamical systems. By the law of superposition of forces, one can 
divide the total force into as many components as one wishes. So for a given problem, one 
first tries to make a list of all the ‘obvious’ forces. Usually they are the ones that are defined 
by the universally recognised laws of forces, such as Newton’s law of gravitation, Coulomb’s 
law of electrostatics, and so on. They really do not depend on the nature of constraint 
relations, e.g., the sort of surfaces on which the particles are constrained to move. It is 
customary to identify all such externally applied universal forces and include them under 
the category of externally applied forces. The rest of the forces are assumed to originate from 
contact with the constraining surfaces. Such forces are categorically classified as the forces 
of constraints. Unfortunately, Newton has not given any prescription for calculating these 
forces of constraints. Hence, in absence of the knowledge of the total force, Newton’s second 
law in its differential form cannot be formulated let alone finding a solution to dynamical 
problems involving constraints. 

(v) Work done by the constraint forces: Usually the constraint forces act in a direction 
perpendicular to the surface of constraints at every point on it, while the motion of the 
object is parallel to the surface at every point. In such cases the work done by constraint 
forces is zero. One obvious exception is, of course, the frictional force due to sliding which 
does work for real displacements. Another exception is the rheonomic constraint for which 
the constraint force need not act perpendicular to the real displacement. This can be easily 
seen from Fig. 1.1 where the real path of the bob of pendulum with variable string length 
l = l(t) is shown. We see from the figure that T • Ar ^ O, where T is the tension in the 
string (constraint force) and Ar is the real displacement. However the work done by the 
tension in the simple pendulum is zero as the length of the string remains constant, allowing 
the bob to move perpendicular to the constraint force of tension that acts along the string. 
Generally speaking, rheonomic constraints are dissipative, although there are exceptions. 


Copyrighted material 



Constrained Motions in Cartesian Coordinates 35 



Fig. 1.1 An example of work done by constraint force being nonsero when 
the constraint is rheonomic 


1.2 EXAMPLES OF CONSTRAINTS 

We now give examples illustrating various types of constraints. 

1. Rigid body 

A rigid body is, by definition, a system of particles such that the distance between any pair 
of particles remains constant in time. Thus the motion of a rigid body is constrained by the 
equations 

|r< - r*| = const. (1.2) 

where the pair of subscripts (*,/:) run over all distinct pairs of particles forming the body. 
Obviously this constraint is scleronomic. The constraint is also holonomic and bilateral. We 
prove that this constraint is also conservative, that is, the work done by the internal forces 
(which are the forces of constraint) in the rigid body is zero. This is shown as follows. The 
constraint relations (1.2) can be written as 

|r< - r*| 2 = const. 

Taking differentials, 

(ri - r fc )-A(r< - r k ) = 0 (1.3) 

Now let the internal force of constraint on the tth particle due to the Jfeth particle be 
represented by F ik . By Newton’s third law we have, 

F ik = - F ki 

Thus we have for the work done by Fi k due to a displacement Ar, of the tth particle, 

Fik-Ati = - F k i • Ar,- (1.4) 

where Ar* is a possible displacement (consistent with all the constraint relations) given by 


Copyrighted material 


36 Classical Mechanics 


Eq. (1.3). By virtue of Eq. (1.4) we can write for the total work done by the system 
AH' = Y F ik ■ Ar, = £{*•„. Ar, + F ti .Art} = Y F '* ' A ( r ' “ r *> 

»,k i.k 

k * i k > i k > i 

Again, since all F tk are the internal forces which arise purely due to interaction between 
all possible pairs of particles, it is only natural that F ik will act parallel to the line joining 
the ith and jth particles. Thus we can write, 

Fik — Cik(ri - r*) 

where C^’s are real constants and symmetric in i and k. Substituting in the above expres¬ 
sion for the total work, we have 

AH' = Y c «( r i - '0-A( r < - r») 

* > • 

Now by Eq. (1.3) each individual term of the summand is zero. Thus the constraint of 
rigidity is conservative in nature, apart from its being scleronomic, holonomic and bilateral. 

i. Deformable Bodies 

As opposed to rigid bodies we have deformable bodies, that is, bodies whose shape can 
change. Suppose that the deformation of the body is changing in time according to a 
certain prescribed function of time. Then the motion of such a body is constrained by the 
equation 

|r< - r k \ = f(t) ( 1 - 5 ) 

where, again r* and r k are position vectors and the pair of subscripts (i t k) runs over all 
distinct pairs of particles in the body. It is easy to show that these constraint relations 
cannot give the total work AW = 0, as it did in the previous case. This is a case of 
rheonomic, holonomic, bilateral and dissipative constraint. 

3. Simple Pendulum with Rigid Support 

The position of the bob at any time must satisfy 

|r| 2 = x 2 + y 2 + z 2 = l 2 (1.6 

where / is the constant length of the string connecting the bob to the fulcrum (which i 
taken to be the origin). The tension in the string T is parallel to - r , giving the wor 
done by the constraint force AW = T ■ Ar = 0, as r ■ Ar = 0 from Eq. (1.6). This i. 
a scleronomic, holonomic, bilateral and conservative constraint. 

4- Pendulum with Variable Length 

Suppose the length of the string is changing according to a given function l(t). Then the 
constraint equation is 

hoi* = < 2 «) un 

where f(t) is the position vector of the bob at time t (the origin being at the fixed fulcrum). 
Hence we have r • Ar ^ 0, but since T is parallel to - r, the work done by the constraint 


Copyrighted material 



Constrained Motions in Cartesian Coordinates 37 


force T is AW = T ■ Ar ^ 0. This is a case of a rheonomic, holonomic, bilateral and 
dissipative constraint. Such a pendulum can increase or decrease its amplitude of oscillation 
depending on the time of pulling and releasing of the string of the pendulum through its 
fulcrum. 

5 . A Spherical Container of Fixed Radius R Filled with a Gas 

The constraint relations for the gas particles inside the container are 

M < R (1.8) 

where is measured from the centre of the container. The equality in Eq. (1.8) corresponds 
to the equation of the surface of the container, that is, for the situation when the particle is 
about to bounce off the surface. This is a case of a scleronnomic, holonomic, conservative 
and unilateral constraint. 

6. An Expanding or Contracting Spherical Container of Gas 

Suppose the radius R of the container is changing with time so that the position of any 
particle at any instant will satisfy 

|r| < R(t) (1.9) 

where the equality holds when the particle is just about to bounce off from the wall. If the 
chamber is expanding, the kinetic energy of the bouncing particle decreases at each bounce. 
This is a case of a rheonomic, holonomic, unilateral and dissipative constraint. 

7. A Simple Pendulum with its Bob Sliding on a Circular Track 

Here the constraint forces involve frictional forces due to sliding of the bob on the track. 
Since frictional forces due to sliding are nonconservative (they do nonzero work on the bob 
as it moves from one end to the other) the pendulum loses energy as its motion is hindered 
due to friction. This is therefore, an example of a dissipative constraint. 

8. Rolling without Sliding 

Suppose a spherical ball or a cylinder is rolling on a plane without sliding. We assume that 
the surfaces in contact are perfectly rough. Thus the frictional forces are not negligible. 
Since the point of contact is not sliding, the frictional forces do not do any work, and 
hence the total mechanical energy of the rolling body is conserved. Thus the constraint is 
conservative. To obtain the constraint equation we note that rolling without sliding means 
that the relative velocity of the point of contact with respect to the plane is zero. Then the 
velocity V of any point P in the rolling body, as seen from a fixed frame of reference, is 
given by 

V = V cm + u X r (1.10) 

where V cm is the velocity of the centre of mass and r is measured from the CM to the 
point P under consideration. Thus the velocity of the point of contact (nearest to the axis 
in the case of cylinder) is obtained by putting 

r = - r h 

in Eq. (1.10) where h is the unit vector along the outward normal to the plane and r is 
the radius of the sphere (or cylinder). Since there is no sliding of this point (or the line) we 


Copyrighted 



38 Classical Mechanics 


must have the instantaneous velocity V at the contact 

v = V,„ - r(u x n) = * (1.11) 

This is the required vector equation of constraint representing actually three scalar equa¬ 
tions. For a sphere this constraint is nonintegrable because u is generally not expressible 
in the form of a total time derivative of any single coordinate. Thus the constraint is non- 
holonomic. However, for a cylinder, uj = ( dO/dt) where 9 is the angle of rotation of the 
cylinder about its axis. Therefore this equation of constraint can be integrated and reduced 
to a holonomic form, giving a relation between r and the coordinates of the centre of mass. 


1.3 PRINCIPLE OF VIRTUAL WORK 


1.3.1 Virtual Displacement 


Any imaginary displacement which is consistent with the constraint relation at a given 
instant (that is, without allowing the real time to change) is called a virtual displacement. 

Thus, given a system in a positional configuration r(f) at time t , the set of all virtual 
displacements is a particular subset of the set of all possible displacements. Physically such 
displacements are the displacements that would occur if the system was frozen in its motion 
at time t, and the system was then moved without violating any of the constraints operating 
on the system at that instant. A virtual displacement is finite in magnitude, but in this 
book a virtual displacement actually means an infinitesimal virtual displacement that does 
not violate any of the constraints operative at the given instant t. So by definition, a virtual 
infinitesimal displacement is given by 

6x{ = dxA 

I dt = o 


Example 

Consider a simple pendulum with variable string length /(f). At any instant the string 
length is /(f) and the real displacement of the bob Ar in time Af is not perpendicular to 
the string direction (see Fig. 1.1). A virtual displacement Sr is thought to be an imaginary 
displacement consistent with the constraint /(f) for time t. For the whole of Sr the value 
of /(f) is kept the same as that for the instant t. Hence the virtual displacement £r(f) 
constructed in the above way is perpendicular to the string direction prevailing at time f, 
whereas the real displacement Ar(f) is defined as usual by r(f + A t) - r(f), with the 
fact that r(f + A t) is consistent with the constraint relation at t + At and that r(f) is 
consistent with the constraint relation at t. 


1.3.2 Virtual Work 

Work done by any force on a particle due to its virtual displacement is called virtual work. 
We can express this definition through the equation 

SW = FSr (1.12) 


Copyrighted 



Constrained Motions in Cartesian Coordinates 39 


Here F is the vector sum of the constraint force / and the applied forces and an 
infinitesimally small element of all virtual quantities are usually denoted by a prefix 6, 
reserving A or d for the real ones. 


1.3.3 Principle of Virtual Work 

We know that for a system in static equilibrium the total force on the system vanishes, 
by definition. Hence the virtual work done on such systems due to any arbitrary virtual 
displacement must identically vanish. Thus the total force F on the system given by 

F = / + F (a) = 0 

where / is the sum total of the constraint forces and F* a ^ that of the the applied forces, 
must yield, for the virtual work 

6W = F6r = 0 (1.13) 

Furthermore, if the virtual work done by the constraint forces also vanishes, that is, 

/ 6r = 0 (1.14) 

then the virtual work done by the applied force on a system in static equilibrium also 
vanishes, or in other words, the condition for static equilibrium reduces to 

SW a = & a) -6r = 0 (1.15) 

For a system of n particles we should have 


6W a 


= o 


(1.16) 


The above equation states that the necessary condition for static equilibrium is that the 
virtual work done by all the applied forces should vanish, provided the virtual work done 
by all the constraint forces vanishes. This is called the principle of virtual work. 

The strength of the above principle lies in the fact that for a dynamical system one can 
determine the amount of force one needs to apply to the system in order to make it static 
(which is essential for making the principle of virtual work applicable to the system under 
consideration). We shall discuss this point in a later section devoted to what is known as 
D’Alembert’s principle. 


An Example 

This is shown in the accompanying Fig. 1.2. The motion of two blocks having masses Aft 
and M 2 is constrained by the fact that the string connecting them has a constant length /. 
Obviously, this is a case of a scleronomic, holonomic, bilateral and conservative constraint, 
provided friction due to the pulley can be neglected. Elementary considerations without 
involving virtual work show that the accelerations of the blocks are given by 


*Afl 


(Aft - Aft)f 
+ M2 


~ *M2 


Copyrighted material 



40 Classical Mechanics 


where *mi and *a/2 are measured downward from the horizontal plane passing through 
the centre of the pulley. Thus, for a static situation one must have Xm i = - *a/2 = 0 
requiring M\ = M 2 . 



X M1 


X M2 


M, 


M 2 


Fig. 1.2 Simple Atwood’s machine 


To apply the principle of virtual work to the above case we first note that, since the 
constraint |*An| 4 - |*a/2| = l is scleronomic, the virtual and real displacements are one 
and the same. Next, a displacement of one block causes an equal and opposite displacement 
of the other block. Thus the vanishing of virtual work done by the applied forces requires 
that 

Mi^|A*Afi| - M 2 g\^XM2\ = 0 

or, Mi = M 2 , which is quite obvious. One can verify that in this case the work done by 
the constraint forces also vanishes. 


1.4 THE BASIC PROBLEM WITH THE CONSTRAINT FORCES 

Take the simplest case of one particle motion under a general velocity dependent constraint 
given by the relation, 

g(r,r,t) = 0 (1.17) 

To fulfill the requirement of the constraint, an unknown constraint force / must be intro¬ 
duced in addition to the known applied force F (a) on the particle. Thus Newton’s second 
law takes the form 

mf = F< o) + / (1.18) 

The vector Eq. (1.18) is a set of three scalar equations and the Eq. (1.17) is a single scalar 


Copyrighted material 



Constrained Motions in Cartesian Coordinates 41 


equation so that we have in total four equations. The total number of unknowns are the 
three functions z(<), y(t), z(<) (or any other set of three independent Cartesian coordinate 
functions represented by the vector r) and the three components of /, that is, six in all. 
Thus we have four equations in six unknowns, a problem which does not possess a unique 
solution. This is the basic problem in dealing with constraint forces. 

Earlier we noticed that unless we can analytically express the forces of constraint on the 
same footing as the externally applied forces, the content of the right hand side of Newton’s 
second law remains incomplete and therefore one could not proceed any further. Now we 
discover an additional problem, that is, even if the constraint relation is completely specified, 
we have too few equations to solve for all the unknowns. At this point, one may begin to 
wonder as to how one solved so many problems of constrained motions using Newton’s 
method. Well, you can now scrutinise all the details of the methods you had applied, and 
find for yourself that some extra equations were indeed formulated in each case. There exists 
no particular rule as to how to formulate such new equations. Usually the torque equations 
and sometimes the energy equations act as supplements to the usual second law. But we are 
now interested in obtaining a precise method of tackling any given dynamical problem of 
constrained motions, so that even a blind can solve the problem. It is like inventing algebra 
(mechanical prescriptions) to replace intuition based arithmetic. 


1.5 LAGRANGE’S EQUATIONS OF MOTION OF THE FIRST KIND 


In order to circumvent the situation noted in the previous section, let us claim (and hope) 
that the constraint relation must contain complete information regarding the restriction 
put on the motion of the system. This cannot be otherwise, because, given a kinematical 
description of the motion we should not need anything more than the constraint relation 
to check whether the motion is consistent with that constraint. This consideration suggests 
that the constraint forces should be derivable solely from the constraint relations. To achieve 
this, one proceeds as follows. 


First consider a nonholonomic constraint, say, the one given by Eq. (1.17). Let us define 
h = dg/dt. Since y(r,f,t) = 0, h(r,r t f,t) must also be equal to zero, that is, 


dg 

dr' 


+ t 


f = 0 


Again, since none of dg/dt , dg/dr , dg/dr contains f explicitly, we have 


dh _ dg 
dr ~ dr 


(1.19) 


and h depends linearly on f, where f is the total acceleration of the particle. 


Now if the constraint were holonomic, say of the form y(r, t) = 0, we would like to define 


Copyrighted 



42 Classical Mechanics 


the function h by h = d?g/di 2 . Arguing as before, as g(r, t) = 0, h = 0, that is, 


W * drdt + di \dr J 




( 1 . 20 ) 


The first three terms on the RHS of the previous equation do not contain any f explicitly 
and so also the factor dg/dr of the fourth term. Therefore, 

dh _ dg 
df ~ dr 

and h is once again linearly related to the total acceleration f. 

Note that in both the cases, ( dh/df ) is a vector function of r and t (and may as well be 
a function of f, if the constraint is nonholonomic) and h is linearly dependent on f. The 
equations 

An 

for the nonholonomic case and 


k = d 4 = o 


dt 


for the holonomic case 


are the additional constraint equations on the total acceleration (or equivalently, the total 
force). This has the general form 


0 = (some function of r, r and i) 


m- 


where f is the total acceleration of the particle and (dh/dir) is a vector quantity determined 
purely from the constraint relation and is itself independent of the total acceleration. 

The above constraint relation on total acceleration is therefore directly affected by the 
vector (dh/df). Only the component of total acceleration parallel to the vector (dh/df) enter 
the above constraint relation, because of the scalar nature of the scalar product (dh/df) • 
f. Therefore, the constraint force cannot have any component perpendicular to (dh/df), 
because if it had, it ought to have contradicted our legitimate expectation that forces of 
constraints be solely derivable from the constraint relations. Or in other words, / must be 
parallel to (dh/df), that is, 

f = (1.21) 

where A is an unknown scalar, that takes care of the required dimension and magnitude of 

/ 

Since g(r,f,t) is given, h and hence / are known except for A. Now there are four 
unknowns and four independent equations giving simultaneous solutions for r and hence, 
along with A, / is uniquely determined. Newton’s equations of motion now acquire the 
form, 

mf - F< a) - A§ = 0 (1.22) 

of 


Copyrighted material 



Constrained Motions in Cartesian Coordinates 43 


More explicitly, that is, in terms of the given constraint relation, we have for holonomic one- 
particle systems 

mr - & a) - A^£f^ = 0 

or 

and for nonholonomic one-particle systems, 

mr - _ o 

or 

These are sometimes called Lagrange’s equations of the first kind , and A is called Lagrange ’s 
undetermined multiplier. 

It may be mentioned that the concept of undetermined multiplier was introduced in 
physics by Joseph Louis Lagrange in 1764. Today even school children use the idea, quite 
unknowingly though, when they try to solve simultaneous algebraic linear equations involv¬ 
ing two or more unknowns. When we find the equations ax + by = p and cx 4- dy = q 
with-z and y as unknowns, we multiply the first equation by c and the second by a and 
then subtract the first from the second in order to eliminate z and thereby evaluate y. Here 
the Lagrange multipliers are c and a respectively. 


1.5.1 Generalization to a System of N Particles with k Constraint Relations 

The above considerations can be easily generalized to the motion of a system of N 
particles, with k constraints; namely 

0 = 0 for * = 1,. -., Ac 

where j in r, and fj runs over all or a fraction of the particles. Let us define 
dg t 

hi = — for the nonholonomic constraints, and 

at 

d?Q\ 

hi = -jjj- for holonomic ones 

The force of constraint on the jth particle due to the imposition of the ith constraint is 
/ - A — 

~ 'dr, 

so that the total force of constraint on the jth particle is given by 
/ - V A — 

where Ai,...,A* are the k Lagrange multipliers that are introduced. Thus Newton’s 
equation of motion for the jth particle having mass mj becomes 


rt-'S' 4 -!>%-• i-' .* 


(1.23) 


Copyrighted material 



44 Classical Mechanics 


where F^ o) is the total external force applied on the jth particle. 

These N vector equations (or, 2N scalar equations) apply to the cases of holonomic or 
nonholonomic, and sclerononic or rheonomic constraints that are expressible in the bilateral 
forms only. Note that the total number of equations one has to deal with is 3JV + k , 
as the number of dynamical equations is 3.V and the number of constraint equations is k. 
Moreover, these are coupled equations (see the last summand in Eq. (1.23)) and hence their 
integration is quite involved. For this reason Lagrange’s equations of the first kind find little 
use in actual practice. Nevertheless, once solved, they provide a complete solution to the 
dynamical problems of most diverse nature. The method is precise and complete. 


1.5.2 An Example 

Let us study the motion of a simple pendulum oscillating in the x - z plane. For this 
system, the holonomic constraints are 

9i = y = 0 

and <72 = x 2 + y 1 + z 2 - / 2 = 0 

The applied forces are F z — 0 = F y and F z = mg. Since there are two constraints g\ 
and < 72 , two multipliers say Ai and A 2 are needed. The equations of motion are 

- \ 9gi \ d92 

mx - ~ 

\ &g\ . 002 

mv - A, ^r ■ar 

~ „ , , 002 

mz-mg - A,^- - A 2 ^ 

Substituting for the partial derivatives 

mx — 2 A 2 x — 0 

my - Ai - 2A 2 p = 0 

mi - mg - 2A 2 z = 0 

Let us assume that the oscillations are of small amplitude, giving for 
z 2 = / 2 - x 2 - y 2 


(1.24) 


-f 


X 2 + y 2 
l 2 




l 


as x 2 + j/ 2 < I 2 (second order of smallness). Thus to the first order of smallness of the 
amplitude, z = constant, and therefore z and z are negligibly small compared to x and x. 
Putting z = /, z = 0, we get from the last of Eqs (1.24), 


A 2 = 


mg 

21 


Since y = 0 at every instant of time, y = y = 0, giving Ai = 0 from the middle one 


Copyrighted material 



Constrained Motions in Cartesian Coordinates 45 


of Eq. (1.24). Finally the first equation of (1.24) gives 


x + 


= 0 


which describes an SHM in the x-coordinate with angular frequency u> = y/gfl. Moreover, 
Ai = 0 and A 2 = — mg/21 imply that: 

the x-component of the constraint force = 2A 2 x = - mgx/l = - mg sin 9 2 - - mgO 

where, 9 is the angular position of of the bob with respect to the local vertical, 

the y-component of the constraint force = Ai + 2A 2 y = 0 and 

the 2 -component of the constraint force = 2A 2 z = - mgz/l — —mg cos 9 — — mg. 

These are nothing but the components of the tension in the string. 


1.6 GIBBS-APPELL’S PRINCIPLE OF LEAST CONSTRAINT 


Willard Gibbs (1879) and later Paul Appell (1899) gave a new meaning to Lagrange’s 
equations of motion of the first kind. Following their suggestions, let us define a quantity 
called the kinetic energy of acceleration of a system of N particles, given by 


s = I Z "*>M* 


(1.25) 


where velocity is replaced by acceleration in the usual expression for kinetic energy. Now 
Lagrange’s equations of the first kind as given in Eq. (1.23) reduce to the form 



(1.26) 


G = S - £F</>.r, - 


(1.27) 


is a scalar point function of acceleration formed out of known quantities such as externally 
applied forces and constraint equations. This is called Gibbs-Appell’s form of the equations 
of motion. More recently, T. R. Kane (1969) has developed a new scheme of setting up of 
equations of motion, which are in spirit quite similar to Gibbs-Appell’s. 

Furthermore, it is easy to check that 


d 2 G 

dr* 


d 2 S 


> 0 


So the function G is such that its first derivative with respect to acceleration fj is zero 
by requirement of the equation of motion and its second derivative with respect to fj is 
positive as the mass of any particle is greater than zero. This means that Gibbs-Appell’s 
form of equations of motion is a minimum for G with respect to all possible variations 
of fj. The function G is called Gibbs-Appell’s least constraint function. Gibbs-Appell’s 


Copyrighted material 



46 Classical Mechanics 


principle of least constraint states that for a given set of position vectors Tj and velocities 
Tj , j = 1,...,7V, the Gibbs-Appell function G(r,f,f) is a minimum if and only if the 
accelerations Tj (j = 1,...,7V) are chosen to be the measured or the actual ones. This 
effort of Gibbs and Appell can be viewed as an early attempt to geometrise dynamics. 


1.7 D’ALEMBERT’S PRINCIPLE 

So far our approach has been to evaluate the forces of constraints and solve Newton s 
equations of motion. This is obviously a tedious procedure as is apparent in the example 
worked out above. Now, the question is, can we totally eliminate or bypass the determination 
of the forces of constraints? The answer as we know today is an affirmative one, provided we 
stop referring to forces, and deal directly with work, or kinetic or potential energy, and/or 
introduce non-Cartesian coordinates, if such coordinates appear to be more natural than 
the Cartesian ones. Following the historical sequence of developments, first we move on to 
a picture based on the concept of work, or more precisely, virtual work. 

We have seen that constraint forces do work in dissipative and rheonomic systems. For 
a dissipative system such as one having friction, the constraint forces can be taken care of 
by promoting them to the list of externally applied forces. The real problem would then lie 
only with rheonomic systems. These systems can be handled by using the idea of virtual 
work. 

1.7.1 Conditions for Vanishing Virtual Work due to Constraint Forces Alone 

We have seen earlier that the principle of virtual work involving only applied forces holds 
good if the virtual work done by the constraint forces vanishes identically. We now proceed 
to obtain general conditions under which this is valid. The work done by the constraint 
forces arising due to the specified constraints, under arbitrary but infinitesimally small 
virtual displacements of the particles compatible with the constraints is 

tw (128) 

j = 1 i = 1 = t ’ / 

Now bW vanishes identically for the following two types of constraints, namely, for 

1. all holonomic constraints: We have, by virtue of Eq. (1.20) 

because all bgi = 0 trivially by virtue of the constraint equations gi = 0. 

2. those nonholonomic constraints which are homogeneous functions of velocities except 
for an additive function of position and time: By virtue of the definition of homogeneous 
function of velocities of order n, we must have 

gi(r t ar,t) = a n gi(r,r,t) (1.29) 


Copyrighted material 



Constrained Motions in Cartesian Coordinates 47 


where a is any scalar constant and n is a non-negative integer. For example, 

g = a(x,y,t)x 3 + b(x t y,t)y 3 + c(x,y,t)x 2 y + d(x,y,t)xy 2 + 3 e(x,y,t) 

satisfies this required condition for n = 3 , except for the additive function e of coordinates 
and time. For such nonholonomic cases, consider 

6W = (1.30) 

To ensure 

E 6 r i'S i=0 O' 31 ) 

i ° Tj 

the function dgi/drj must satisfy 

y\^r-drj + nJi(r jy t)dt = 0 

i dTi 

for any real displacement drj in real time dt, because the above equation reduces to Eq. 
(1.31) when the displacement drj is replaced by a virtual displacement 6rj as it presumes 
a condition dt = 0 for all virtual displacements. Here J,(r 7 ,/) can be any arbitrary 
function of coordinates and time. The above condition can now be rewritten as 



The first summand will cancel with J,(ry,<) if the gC s are purely homogeneous functions of 
velocities of any order n plus a term Ji(rj,t). Thus those nonholonomic constraints which 
are homogeneous functions of velocities plus a suitable additive function of coordinates and 
time can have a set of nonzero Ji{rj, t) such that a total of zero virtual work is done by all 
the constraint forces. 

Under these two types of situations, the virtual work done by the constraint forces van¬ 
ishes. Therefore, the necessary condition for static equilibrium becomes 

-0 (1.16) 

i 

where F* a) is the total applied force on jth particle. 

1.7.2 D’Alembert’s Principle 

We can use the above condition for vanishing virtual work done by the constraint forces for 
any dynamical system, not necessarily in static equilibrium. For the j th particle in such a 
system, Newton’s equation of motion reads as 

i*;' + /, = ?, (i.32) 


Copyrighted material 



48 Classical Mechanics 


where fj is the constraint force, is the applied force and p } is the linear momentum, 
all pertaining to the jth particle. Now the system also satisfies, by requirement of vanishing 
virtual work done by all constraint forces, 

sw = Y,fr 8r 'i = 0 (1-33) 

i 

Taking the scalar product of each term in Eq. (1.32) with the infinitesimal virtual displace¬ 
ment of the jth particle 6tj and summing over all particles of the system, we get after 
accounting for Eq. (1.33) 

£(*<•> -#>)•*, -0 (1.34) 

i = i 

where - pj appears as an effective force called he reverse force of inertia on the jth par¬ 
ticle, supplementing the already existing externally applied force Fj a K The key point is 
that in Eq. (1.34) we have got rid of the constraint forces fj. Equation (1.34) generalises 
the principle of virtual work to a far wider class that inched the dynamical systems encom¬ 
passing all holonomic and a large class of nonholonomic systems for which the virtual work 
vanishes. Equation (1.34) is commonly called D’Alembert’s principle after its propounder, 
who published it in 1743. 

Remarks: (i) Unlike Newton’s 3 N equations of motion, D’Alembert’s principle is just one 
equation of motion. 

(ii) D’Alembert’s principle does not involve the forces of constraints in any way. So it is 
sufficient to specify all the applied forces only. 

(iii) It is so general that even a knowledge of the constraint relations is not explicitly 
required except for determining Stj. 

(iv) Its validity extends to all rheonomic and scleronomic systems that are either holo¬ 
nomic, or homogeneous nonholonomic (see the previous subsection). All such systems for 
which D’Atembert’s principle is valid may be called D’Alembertian systems. 

(v) The inertial force - p } is introduced to reduce the problem of dynamics to one of 
statics (compare Eq. (1.34) with Eq. (1.16)). 

(vi) The force of inertia can be regarded as an inertial force arising in an accelerated 
frame of reference. In this sense, this is a forerunner of the equivalence principle of the 
general theory of relativity. In a freely falling accelerated frame f^ o) - pj = 0, gravity is 
nullified by the inertial force even though the whole system is in motion and not in either 
static or dynamic equilibrium. 

(vii) D’Alembert’s principle, it is often said, gives a complete solution to the problems 
of mechanics. All the different principles of mechanics are merely mathematically different 
formulations of D’Alembert’s principle. Hamilton’s principle (see chapter 6) can also be 
obtained from D’Alembert’s principle. However, one should remember that Lagrange’s 
equations of the first kind in the form given above, are even more general as they are valid 
for all holonomic and nonholonomic systems. 

(viii) D’Alembert’s principle is more elementary than the variational principles on another 
account; it requires no integration with respect to time. However the disadvantage is that 


Copyrighted 



Constrained Motions in Cartesian Coordinates 49 


the virtual work of the inertial force is polygenic and thus is not reducible to a single scalar 
function. This makes the principle most unsuitable for the use of curvilinear coordinates. 

1.7.3 Some Applications of D’Alembert’s Principle 

(i) A spherical pendulum of varying length: A particle of mass m is suspended by a 
massless wire of length 

r = a + 6 cos(u;<) (a > b > 0) (1.35) 

to form a spherical pendulum. Below we find the virtual displacements and the equations 
of motion using D’Alembert’s principle. 

Let r, 8 and <f> be the spherical polar coordinates of the particle. The acceleration of the 
particle is given by (see Appendix A 2 ) 

f = (f - r9 2 - rtf sin 2 8 )f + (rO + 2 r0 - rtf sin 0 cos 0)0 ^ ^ 

+ (r 0 sin 0 + 2 n£sin 0 + 2 r 00 cos 0 )^ 

Virtual displacement must be consistent with the instantaneous constraint, namely r = 
constant, therefore, 

6r = r68$ 4 - rsin06<^ (1-37) 

The external force on the system is 

F = -- mgk = -mg(cos8r - sin 00) (1.38) 

By D’Alembert’s principle, we must have 

(F - mr) ■ 6r = 0 (1.39) 

which when coupled with Eqs .(1.35 - 1.38) gives the equations of motion in 9 and <f>. 

(ii) An incline that makes an angle a with the horizontal is given a horizontal acceleration 
of magnitude a in the vertical plane of the incline, in order to prevent the sliding of any 
frictionless block placed on the incline. We want to find the value of a. 

Virtual displacement must be consistent with the instantaneous constraint, that is, 6x = 
61 coso, 6y = 61 sin a, where 61 is a possible virtual displacement along the incline. 
The forces applied on the system are F x = 0 and F v = mg. We do not bother about 
constraint forces. By D’Alembert’s principle, we have 

F x 6x + F y 6y - ma z 6x - ma y 6y - 0 
or 

0 + mg 61 sin a - ma 61 cos a - 0 = 0 
since a y = 0 (given) and a z = a (also given). This gives 
a = <7 tan a 

which is the required horizontal acceleration of the incline in order to prevent the sliding of 
the block placed on the incline. 


Copyrighted 



50 Classical Mechanics 


1.8 SOME ADDITIONAL REMARKS 


(i) We have already seen that systems having rheonomic constraints are nonconservative. 
Real work done by forces due to, say, holonomic, rheonomic constraints can be calculated 
as follows. We have 


—± - Y' V \ lil 
dt dt ~ Zr'2-' i dr j 


dTj _ sr \ 

dt - 4* ‘\dt dt) 


Since gi = 0, dg t /dt = 0, and the constraints being rheonomic, that is, dgi/dt ± 0, we 
have 


dW 

dt 


* 0 


Therefore any holonomic but rheonomic system is in general nonconservative. Note, 
however, that the sum £A i(dgi/dt) may still vanish even if none of the individual dgi/dt 's 
is zero. This makes some room for exceptional cases. 

The real work done by the constraint forces can be obtained by integrating the above 
equation. The negative sign in the equation does not mean that work is always done on the 
system by the constraint forces, as the actual sign will depend on the signs of individual 
factors and terms in the summand. 

(ii) Consider a system with unilateral constraints in equilibrium. Physically, a unilateral 
constraint is realised by a surface which divides the total space into two regions. The motion 
of the system is restricted to only one of the two regions and the system is not allowed to 
move across the surface. Therefore, when the system is on this dividing surface, the virtual 
displacements cannot have components in the direction of the unit normal pointing inside 
the prohibited region. However, since the system is driven to this surface under the action of 
the applied forces, these must have components in the direction of the unit normal defined 
above. Therefore, allowed virtual displacement can have only negative components in the 
direction of the applied forces. This means that the condition for equilibrium, Eq. (1.16) 
gets replaced by 

6W a = Y, F< i a) - 6r i ^ 0 

Equality is applied when the system is not on the constraint surface. Thus for example, for 
a ball hung from a ceiling by a string, a downward virtual displacement from its equilibrium 
position is not allowed while an upward virtual displacement is allowed in which case work 
done by the applied force of gravity is negative. Note that for a 1-D motion, the constraining 
‘surface’ is a point, for a 2-D motion it is a curve and for 3-D motion, it is a closed surface. 


1.9 WORK-ENERGY RELATION FOR CONSTRAINT FORCES OF SLID¬ 
ING FRICTION 

Friction plays a great role in our day to day life. Without friction, we could not walk, 
the cars could not move on the roads unless fired by a rocket, we could not hang a thing 
on or stick to the wall, and so on. The force of friction between two surfaces depends 


Copyrighted 



Constrained Motions in Cartesian Coordinates 51 


simultaneously on how hard they are pressed against each other by a force normal to the 
surface of contact and on the forces of pull or push that act parallel to the surface of contact. 
The forces of pull or push remain exactly balanced by the forces of friction that develop at 
the interface, up to limits set by the coefficient of static friction and the normal component 
of the pressure force. Since the point of contact does not move till the sliding occurs, the 
force of friction does not find a chance to do work on the bodies and the energy of the 
system remains unaffected. However, in the presence of sliding friction, the mechanical 
energy of the system gets gradually converted into heat, and therefore it is the first law 
of thermodynamics, rather than the simple conservation of mechanical energy that would 
be the most appropriate conservation law to be applied. A usual paradox goes like this: 
suppose a block is dragged at constant speed across a table with friction. The applied force 
of magnitude / acting through a distance d does an amount of work fd. The frictional force 
(= /> since no acceleration is observed) does an amount of work - HkNd = - /d, 
thus suggesting the total work = fd - fd = 0. For a point particle, the work done is equal 
to the change in kinetic energy, leading to the result that fd - fd = 0 = A(mv 2 /2), 
that is, v does not change. This is fine, but where is the energy term representing the 
increased internal energy of the block? Thus the above treatment gives a right answer for 
the speed of the block, but not the correct work-energy relation. It ought to incorporate 
the first law of thermodynamics. 

Moreover, for calculating the frictional work, the d used for the work done by the applied 
forces cannot be used. It is true that the block has moved through a distance d, but the 
frictional force at the interface has not worked through ail that distance. Sherwood and 
Bernard (1984) suggest that d should be replaced by defr < d for calculating the frictional 
work; the exact relation between defr and d should depend on the nature of the two surfaces 
at contact of sliding. When two surfaces are of identical nature d e fr = d/2, when the sliding 
upper block is soft and the resting lower block is very hard d e ff = 0, and when the sliding 
upper block is hard and the resting lower block is soft d c fr = d. One should also consider 
the heat exchange across the surface, depending on the thermal conductivity of the sliding 
blocks. 

When a block slides through a distance d down an incline (angle of inclination a) with 
friction, the block’s hot teeth are continually transferring heat to the new cold regions of 
the incline, so that the possible heat transfer from block to incline |Q| is not negligible. 
Newton’s method would suggest the work energy relation be given by 


(mg sin a - HkN)d = 



but from the first law of thermodynamics, the Sherwood equation would give 


(1.39) 


(mg sin a)d - /i fc Wd eff - |Q| = A Qmv>) + AE thetmal of block (1-40) 

However, Eq. (1.40) alone is also incomplete as the effective displacement d e fr of the 
frictional force is unknown. So one has to combine the Eqs (1.39) and (1.40), giving 


HkN(d - den) = i normal of block + \Q\ 


Copyrighted 



52 Classical Mechanics 


Since the right hand side is positive, < d. Further, if we consider the universe as the 
closed system (that is, block + incline + earth) the total change in the thermal energy of 
the universe would be simply /nlVd. 


1.10 SUMMARY 

Constraints are defined and classified. In holonomic cases, these are algebraic equations 
of the surfaces on which motion of the system is constrained. In nonholonomic cases, the 
constraint equations are irreducible functions of velocities. If the surfaces of constraints in 
a given problem change with time, the constraints are rheonomic, otherwise scleronomic. 
Other classifications are not so essential. 

Forces are broadly classified into two main types. Forces that appear to be capable of 
producing actions due to some well known external force-bearing agents are grouped into 
external or externally applied forces. The rest, arising from the details of constraining are 
grouped into forces of constraints. 

Since by definition, the external or applied forces are all known forces, the key problem 
of dynamics is either to determine all the forces of constraints or eliminate them from the 
final equations of motion. In any case, if we want to retain Newton’s form of the equations 
of motion, we have to determine the forces of constraints. Demanding that the constraint 
relations must contain all the necessary information on the forces of constraints, it is possible 
to have a satisfactory formulation commonly known as Lagrange’s equations of motion of 
the first kind. 

The other approach, namely, eliminating all the terms bearing the forces of constraints 
from the final equation of motion, using notions of virtual displacements and virtual work is 
due to D’Alembert. It is shown that virtual work done by constraint forces vanishes totally 
for all holonomic constraints and also for a subset of nonholonomic constraints. D’Alembert’s 
principle is therefore considered to be far more powerful than Newton’s equations of motion. 

In Gibbs-Appell’s form, dynamical problems involving constraints find a direct geomet¬ 
rical interpretation in the sense that natural paths under the given constraints maintain 
Gibbs-Appell’s function at their minima. 

Finally, the section on constraint forces due to sliding friction tries to convey the message 
that the phenomenon of sliding friction is still not well understood. The work energy relation 
cannot just be handled by dynamics alone, the first law of thermodynamics has to step in 
and this must require the knowledge of the quality of roughness of the sliding surfaces. 

PROBLEMS 

1.1 Write down the equations of constraints in Cartesian coordinates for the following dy¬ 
namical systems and categorize them according to the classification of the constraints: 
(i) A small rigid rod of length l is allowed to move in any manner inside a balloon of 
fixed radius R > l, the end parts of the rod always touching the balloon’s surface. 
What changes would it haVe, if R is now R(t) = R + at, say, a being a small 


Copyrighted material 



Constrained Motions in Cartesian Coordinates 53 


constant. 

(ii) A piece of flexible but nonextensible string of length l 0 is tied to the ceiling at 
a horizontal separation of / < l 0 . A heavy bead is allowed to slide in any manner 
without any friction along the string. Because of the weight of the bead the string is 
all the time stretched into a ‘V’ shape. 

(iii) The motion of a ship on the surface of the earth when the earth is expanding 
slowly with time. What would be the general nature of the constraints if the earth is 
further assumed to rotate 20 times faster than its present rate? 

(iv) A pair of cartwheels of radius R , the centres of which are connected by a rigid 
shaft of length l is allowed to roll without slipping down an incline that makes an 
angle a with the horizontal. 

(v) The motion of the filament of a bulb that is socketed in a table lamp stand which 
consists of a fixed base and two rigid stems with two flexible spherical joints before 
meeting the bulb holder. Does it make any difference if the filament is assumed to be 
a part of a rigid line or a coiled piece of wire? 

(vi) The motion of a crank-shaft connecting a moving piston on one side and the spoke 
of a wheel on the other. 

1.2 Prove that D’Alembert’s equation represents the conservation of energy, if virtual 
displacements are regarded as real ones. 

1.3 A mass point moves on the outside surface of the upper hemisphere of a globe. Let 
its initial position r and initial velocity t> be arbitrary, except that the latter is to 
be tangential to the surface of the sphere. The motion is to be frictiouless, occurring 
solely under the influence of gravity. Investigate the problem in terms of the Cartesian 
coordinates only and find at what height from the centre the particle should jump off 
the sphere. 

1.4 A block of mass m slides down on a frictionless incline. Solve for its equations of 
motion using D’Alembert’s principle and Lagrange’s method. 

1.5 Find the equations of motion of a solid sphere rolling down on an incline using Lagrange 
multipliers for the rolling constraints. What would be the work energy relation if there 
is slipping as well as rolling? 

1.6 A particle of mass m is suspended by a massless wire of length r = a + fccoswf, (a > 
b > 0) to form a spherical pendulum. Find the virtual displacements and the 
equations of motion using D’Alembert’s principle. 

1.7 Inspired by Gibbs-Appell’s principle of least constraint, Heinrich Hertz in 1896 derived 
his principle of least curvature for systems under the action of no external forces. 
Take the mass of all the particles to be unity and use Gibbs-Appell’s principle of least 
constraint to derive Hertz’s principle of least curvature which states that ‘every free 
system remains in a state of rest or of uniform motion along the path of the least 
curvature’. The curvature of the trajectory, is defined as \fR, where 



Copyrighted materi 



54 Classical Mechanics 


ds being the line element in a 3 N dimensional Euclidean space, given by ds 2 = 
£2* dx\, and N being the number of particles. 

1.8 A bead of mass m is constrained to slide down a frictionless right circular helical wire 
under the influence of gravity. Assume the axis of the helix to be vertical (z-axis), a 
the radius of the helix, 2x6 the constant pitch length, so that the parametric equation 
of the helix can be written as r( A) = a cos At + a sin Ay + 6AJb. Find the constraint 
force and an explicit solution to the equations of motion. If the wire had been in the 
shape of a parabola, say x 2 = 4az, a > 0, what would have been the equation of 
motion? 


Copyrighted material 




2 

Lagrangian Formulation in 
Generalised Coordinates 


2.0 INTRODUCTION 

In the previous chapter, we have seen with great relief that D’Alembert’s principle does 
not require a knowledge of the forces of constraints, although the constraint equations are 
of course needed for providing the virtual displacements. This is fine, but our problem is 
to obtain the solutions for 3 N Cartesian coordinates (IV being the number of particles 
involved). Lagrange’s equations of the first kind are 3 N in number, whereas D’Alembert’s 
principle is a single equation. Coupled with the given constraint equations, the former set 
of equations but not the latter is capable of providing us with the complete solution to the 
problem. This is the basic limitation of D’Alembert’s principle. 

It was once again Joseph Louis Lagrange (1736 - 1813) who at the age of 19 conceived of, 
and at the age of 23 formulated, an ingenious analytical method that allowed him to extract 
a sufficient number of independent equations from just the one, D’Alembert’s principle. He 
finally published the work in the form of a book entitled Mechanique Analytique , although he 
had finished the manuscript 6 years earlier. Breaking the tradition of all his predecessors, 
Lagrange did not put in his book a single diagram, or a construction, or geometrical or 
mechanical reasoning. The book was just full of algebraic operations. Sir William Rowan 
Hamilton later described the book as a ‘scientific poem’; E. T. Bell called it ‘the finest 
example in all science of the art of getting something out of nothing’. 

At Turin, the boyish professor Lagrange used to lecture to the students, all older than him¬ 
self, and founded the Turin Academy of Sciences at the age of 22. Lagrange sent some of his 
works to Euler when he was still in his teens. Euler had baffled for long with his semigeomet- 
rical methods to tackle the problem of isoperimetry, found Lagrange’s new scheme straight¬ 
forward and elegant and solved the problem immediately. He and D’Alembert schemed to 
get Lagrange at the Berlin Academy. On the invitation of the king the Great Frederick, 
Lagrange joined the Berlin Academy in 1766 to succeed Euler, and Euler returned to St. 
Peterburg in Russia. 

Lagrange got the grand prize of the French Academy of Sciences in 1764 for solving the 
problem of libration of the moon, that is, why we see the same face of the moon all the 
time. He again captured the prestigious prize in 1766 by explaining the inequalities in the 
motion of the satellites of Jupiter. He captured it for the third time in 1772 by solving the 


Copyrighted material 



56 Classical Mechanics 


three body problem in gravitation, for the fourth time in 1774 for his theory of the motion 
of the moon, and for the fifth time in 1778 for the calculations of the perturbations of 
planets on cometary orbits. Lagrange joined the French Academy in 1787 on the invitation 
of Louis XVI; the French revolution began and his close friend Lavoisier was guillotined, but 
Lagrange was saved because he never criticised anybody. He was famous for his standard 
reply ‘I do not know’ to anything and everything that was controversial. After the French 
revolution, ‘Ecole Normale’ was established in 1795 and Lagrange was appointed a professor 
of mathematics. This institute was closed in 1797 and instead ‘Ecole Polytechnique’ was 
founded in the same year, where Lagrange found a position, which he held till he died 
in 1813. He finished another two books, The Theory of Analytic Functions in 1797 and 
The Lectures on the Calculus of Functions in 1801, whose limitations made young Cauchy 
develop the theory of complex variables. 


2.1 CHANGE OF NOTATION 


Consider a system of N particles. We rename and arrange the Cartesian coordinates and 
masses of all the particles in the following order. 


Particle no 1 : Coordinates 


Mass 


Particle no 2 : 


Xi -* X\ 

mi —* mi 

yi — x 2 

mi — ► 7712 

Zi -» z 3 

mi —» m 3 

Coordinates 

Mass 

x 2 -» X4 

m 2 —♦ m 4 

V2 -» x 5 

m 2 —♦ m 5 

Z2 —* Xg 

m 2 — » mg 


and so on. Finally, for 

Particle no N : Coordinates 


Mass 


-» *3 N-2 

Vn ss/v-i 

zn -* *3JV 


mjv —» msjv-a 
m/v —» m 3 N -1 
mjv —► msN 


The rth particle has Cartesian coordinates (z 3r _ 2 ,x 3 r-i,X3r) and mass m 3r _ 2 = m 3r _i = 
m 3r (the original mass m r ). With this notation D’Alembert’s principle (see Eq. 1.34) takes 


Copyrighted 



Lagrangian Formulation 57 


the following form: 

= ° ( 2 . 1 ) 


This is a single equation with 3 N bracketed terms in series, the sum being equated to zero. 
Because of the constraint relations 6xi are not all arbitrary. In other words, all the Sxi 
are not linearly independent, and the expression within each parenthesis need not vanish 
separately. 

In the above notation, Lagrange’s equations of the first kind become 


* z ' _ _ £ K^r = o 

1 ' OXi 


‘ dt* 


( 2 . 2 ) 


where k and k' are the total number of holonomic and nonholonomic constraints respectively. 
These are 3 N differential equations each containing a maximum of k + k' + 2 terms. There 
are further, k + k' equations of constraints. Thus 3 N + k + k' equations are to be solved 
for the 3 TV unknowns in n and k + k' unknowns in A r . In most treatments, D’Alembert’s 
principle is obtained first and then Lagrange’s equations of the first kind are derived from it. 
We have reversed the order because Lagrange’s equations of the first kind are more general 
than D’Alembert’s principle. However neither method is economical except for a few specific 
cases, because there are too many unknowns to be solved for. Now the question is, can one 
minimise the number of unknown variables. The answer is, in general, ‘yes’. There comes 
the concept of degrees of freedom of a dynamical system and with it a powerful technique 
for solving dynamical problems developed by Euler and Lagrange. 


2.2 DEGREES OF FREEDOM 

The minimum number of independent variables (say ui,it 2 ,...u n ), required to fix the posi¬ 
tion and the configuration of a dynamical system which are compatible with the given con¬ 
straints is called the number of degrees of freedom (DOF) of the system. These independent 
variables must be sufficient in number to describe all possible positions and configurations 
of the system consistent with the given constraints. By independent we mean that for the 
n variables given by ui,...,u„, if we have n constants ci,...,c„ satisfying 

y] Cidm = 0 
»= 1 

at any point, then it necessarily follows that 

c, = C2 = • = Cn = 0 

Thus the differential change of the value of any one of a given set of independent variables 
cannot be obtained by any linear combination of the differentials of the other variables at 
any point. In other words there cannot exist any constraint relation for any given set of 
independent variables. 


Copyrighted 



58 Classical Mechanics 


We now give examples to enumerate the number of degrees of freedom. 

1. A free particle : Since we are dealing with the motion of particles in three dimensional 
Euclidean space (£ 3 ), a free particle c ,n have the maximum degrees of freedom bounded 
by the dimen&io.iality of its position space, which is 3 for £ 3 . The three Cartesian 
coordinates of a free particle are independent variables, each varying between -00 and 
+ 00 . So the number of DOF for a free particle is 3. 

2. N free particles : Each particle requires 3 independent coordinates to be specified, 
Hence N particles require 3 N independent coordinates to completely describe them. 
Thus the number of DOF is 37V. 

3. TV particles with k constraint relations : When the values of any of 3TV - k coordinates 
are known, the values of the remaining k coordinates are fixed by the requirement that 
at every instant of time all the 3A coordinates must satisfy the given k constraint 
relations. Thus only 37V - k independent variables are needed to completely specify 
the state of the system. This is a standard procedure of finding the number of DOF 
of any dynamical system, namely, first find the number of particles in the system and 
second, the number of constraints. Of course one has to make sure that each of these 
k constraints is independent of all the others. So in this case the number of DOF is 
37V-ife. 

4. The fixed fulcrum of a simple pendulum : This is a point fixed on a ceiling. It requires 
three coordinates, say (z<n 2 /o * *o)> to represent its position. But all these coordinates 
are fixed, not variable. Hence its number of DOF is 0. We can express this fact in 
terms of constraints. This is done as follows: x = Xq, y = j / 0 and z = z 0 are the three 
constraint relations. Thus according to the standard procedure, the number of DOF is 
3 x (number of particles) - (number of constraints) = 3 x 1 - 3 — 0. 

5. The bob of a conical pendulum: Assume that the bob is a particle. The constraint is the 
fixed length / between the moving bob and the fixed fulcrum, that is, x 2 + y 2 + z 2 = l 2 , 
if the fulcrum is taken to be the origin. Hence the number of DOF is 3 x 1 — 1 = 2. 

6 . A dumbbell : The idealization of this is two heavy point particles joined together 
by a massless rigid rod. Thus this system consists of two particles (TV = 2) with one 
constraint (Z 2 -Xi) 2 + ( 1/2 ~Vi) 2 +(z 2 ~*i) 2 = l 2 , where l is the length of the connecting 
rod, and the points {x\,y\,z\) and (z 2 >i/ 2 >* 2 ) are the coordinates of the two massive 
particles. Hence the number of DOF is3x2-l = 5 

7. Three point masses connected by three rigid massless rods : In this system there are 
three particles and three constraints between them, due to the rigid rods. Thus the 
number of DOF is 3 x 3 - 3 = 6 . 

8 . A rigid body : The mathematical idealization of a rigid body is a system with a large 
number of particles (point masses) not all lying on one line, and with all its particles 
at fixed distances from each other. Take any three points which are not collinear, their 
number of DOF = 6 according to the previous example. If one fixes these three points, 
the body (consisting of TV particles) is immovable. Hence the number of DOF of a rigid 
body having TV > 3 is 6 , which is independent of TV. 

9. A rigid body fixed at one point : Since it is fixed at one point we lose 3 degrees of 
freedom. Hence the number of DOF of this system is 6 - 3 = 3. The body can rotate 
freely about this fixed point with these degrees of freedom. 


Copyrighted 



Lagrangian Formulation 59 


10. A rigid body rotating about a fixed axis in space : In this case, the number of DOF is 

1 . 


2.3 GENERALISED COORDINATES 

We have seen that for a system of N particles with k independent constraints, the number 
of independent variables required to specify its configuration and position is n = 3 N - k, 
which is less than the total number of Cartesian coordinates involved. The question is how 
to specify these independent variables. We can of course, choose any 3 N - k out of the 3 N 
Cartesian coordinates to be the independent variables. However, this choice is by no means 
binding on us. We can choose any set of the required number of independent quantities say 
(qit 02 * • • •, q n ) such that all Cartesian coordinates are known functions of the qi variables: 

*i = *l(tfl.92>-l9n,0 

X 2 = ®2(9l,92,-,9n,<) 

: , „ (2-3) 

Xi = Xi{qi,q 2 ,...,q n ,i) 


XiN = *3/v(9l»?2»—»0n,O 

One should note that if there is an explicit time dependence in some or all of the func¬ 
tions defined in Eq. (2.3), the system is called rheonomic. Otherwise the system is called 
scleronomic. 

Obtaining a solution of the mechanical problem now involves two steps. First, set up the 
equations of motion in terms of the qi variables and integrate them to obtain their time 
dependence. Then the functions x<(f) can be obtained using Eq. (2.3). This can simplify 
the problem, because integrating the equations of motion in the variables may be simpler. 
In fact, choice of good qi variables is guided by this requirement. These variables defined 
through Eq. (2.3) are called generalised coordinates. 

Thus we can define generalised coordinates as the independent coordinates sufficient to 
completely specify the configuration of a dynamical system. They are not necessarily rect¬ 
angular Cartesian coordinates. 

For example, consider the central force problem. For a particle revolving around a fixed 
attracting centre, it is easier to work with spherical polar coordinates, which are related to 
Cartesian coordinates through the equations (see Appendix Al) 


y = r sin0sin0 
z = r cos 9 

Similarly, for a spherical pendulum, 9 and <\> are the most suitable generalised coordi¬ 
nates, and so on. 


Copyrighted 



60 Classical Mechanics 


2.3.1 Remarks 

1. The generalised coordinates do not necessarily belong to any conventional coordinate 
system. They can be any curvilinear coordinates, dimensionless parameters such as angles 
or their combinations, etc. 

2. For holonomic systems, the number of generalised coordinates is exactly equal to the 
number of DOF of the system, that is, for a system of N particles having k holonomic 
constraints, the total number of generalised coordinates is 3 N - k. Hence they are strictly 
independent of each other. 

3. For nonholonomic systems, the number of generalised coordinates required is larger 
than the number of DOF. If there are k holonomic and k' nonholonomic constraints, the 
number of DOF of the system is 3 N - k ~k'. However the required number of generalised 
coordinates is 3 N - k. Hence these generalised coordinates are called quasi-generalised 
coordinates, in order to reserve the term generalised for the holonomic cases only. 

Example 1. A rigid spherical ball is rolling without slipping on a table. A spherical ball, 
being a rigid body, has a total of six degrees of freedom. But it has to satisfy the constraints 
of rolling without slipping which are given by t> cm = rw x n (see example 8 in section 
1.2), a vector equation equivalent to three scalar equations. Thus the number of DOF is 
6 — 3 = 3. Of these three constraints, at least one will turn out to be nonholonomic. 
Hence the minimum number of quasi-generalised coordinates is 4 (for example, two Cartesian 
coordinates for the centre of mass and two angular coordinates with respect to the CM for 
the relative orientation of the point of contact on the sphere). 

Example 2. The problem of a disc rolling without slipping on a table is exactly similar to the 
one above. There is one nonholonomic rolling constraint. Although the number of DOF is 
3, the required number of quasi-generalised coordinates is 4 (the two Cartesian coordinates 
for the location of the CM and two angular coordinates, namely one for the orientation of 
the disc with respect to the vertical and the other the angle of rolling). 

4. The total time derivative of any generalised coordinate is called the generalised coor¬ 
dinate velocity or the generalised velocity in short. It is denoted by ft, that is, 

dqi 

* = ~dt * = (2.4) 

5. The set of all ft, i = 1,2,...,n, span an n-dimensional space R n called the con¬ 
figuration space of the system. At any given time t , the system is located at some point 
in its configuration space, so the time evolution of the system is represented by a definite 
trajectory in its configuration space. The n-dimensional configuration space R n contains 
oo n_1 curves through each point — each curve being a possible path of the system — each 
of which can be traversed at any speed. The space is a dynamical one in the sense that 
motions do actually take place in it. The velocity of a point moving along a trajectory in 
the configuration space has n components ft, * = 1,...,n. 

6. The extended configuration space R n x R l (t) —► R n+1 is geometrical, the extra 
dimension being that for the time parameter t. There are oo n curves through each point — 
each curve again being a possible path. But these paths do not traverse in time, there is no 
motion in the space, there do however exist simply mathematical relations between curves 


Copyrighted material 



Lagrangian Formulation 61 


2.4 LAGRANGE’S EQUATIONS OF MOTION OF THE SECOND KIND 


These are also known as Lagrange’s equations of motion. 

Consider a rheonomic, holonomic system having N particles and k holonomic constraints. 
Thus it has n = 3 N - k degrees of freedom. Choose n generalised coordinates {qi,q 2 ,-.,q n )- 
These are related to Cartesian coordinates through Eq. (2.3). Therefore, the Cartesian 
velocity components are 

dxi dxi ^ dxi . 

*=*-= ar + £«^ (2 - 5) 

From now on, we will use the Einstein summation convention, in which whenever there is a 
repeated index in any term, there is an implicit summation over that index, unless otherwise 
mentioned. For example, in the above equation, the index j is repeated in the second term 
on the RHS, and there is a summation from j = 1 to j = n. Thus it is possible to dispense 
with the summation sign, and we can write that term as (dxi/dqj) qj. Equation (2.5) gives 

dii _ dx, 
dqj ~ dqj 

Next consider 


( 2 . 6 ) 


d 1 

(dxA 

8 t 

f dXi\ 


dx { \ 

dt 1 

{d qj ) 

1 dt \ 

.dqj) 

dqi \ 

dqj) 



_ d_ 

(dx. 

dxi. 




~ dqj 

\dt 


) 


dii 


(2.7) 


We have started to use the summation convention, so there is a summation over the index 
/. The relations (2.6) and (2.7) are true for both rheonomic and scleronomic systems, that 
is, irrespective of whether Xi depends explicitly on time or not. Now since any infinitesimal 
virtual displacement is given by 


Sxi = [dxi] dt= 


dx { . 

dqj 


( 2 . 8 ) 


we can begin to transform D’Alembert’s principle for this system, which at present is of the 
Cartesian (or Newtonian) form, that is, 


‘ dt* 


■Sxi — mi 


rms of the generalise 
mplied) of Eq. (2.9) 

(£)(©•* 


(2.9) 

to a form expressed totally in terms of the generalised coordinates. Consider the first term 
on the LHS (summation over t implied) of Eq. (2.9): 

d?Xi 


Copyrighted 



62 Classical Mechantcs 


■[i(S)-S]‘» 


( 2 . 10 ) 


where T = is the total kinetic energy of the system, by its standard definition in 

terms of the Cartesian coordinates. 

The second term on the LHS of D’Alembert’s equation is 


= -QMi 


which defines 


Used force. Suppost 
?le. 

12), D’Alembert’s e 


( 2 . 11 ) 


( 2 . 12 ) 


as the j th component of the generalised force. Suppose qj is an angle, then Qj would be 
the torque corresponding to that angle. 

Combining Eqs (2.9), (2.10) and (2.12), D’Alembert’s equation becomes 

(2.13) 


Since the q } coordinates are independent, the values of all the Sqj's can be arbitrary. 
Therefore, the above equality can hold good if and only if the individual coefficient of each 
Sqj is separately zero. This implies 


d_ 

dt 



(2.14) 


for every generalised coordinate qj and its generalised velocity qj. These n = 3 N - k 
differential equations are called Lagrange’s equations of motion of the second kind, published 
by him in 1788. This is how Lagrange recovered the requisite minimum possible number of 
independent equations of motion from just one, namely D’Alembert’s principle. 

It is worth noting that the constraint relations are not at all apparent in Eq. (2.14) 
but the choice of the generalised coordinates and the determination of the kinetic energy 
function T in terms of qj, qj and t depend on the knowledge of the constraint relations. 

In the next section we shall see that Lagrange’s equations are n second-order ordinary, 
linear, coupled differential equations. A general solution of these equations involves 2 n 
arbitrary constants of integration. The values of these 2n constants can only be determined 
if we know the state of the system (the values of all the generalised coordinates and velocities) 


Copyrighted material 



Lagrangian Formulation 63 


at some instant of time. Once determined, the motion of the system gets completely specified 
for all time in the past as well as in the future. 

Now, the ultimate justification for any law to exist in physics lies with experiments. Thus 
the verification of the above formalism (a restatement of Newtonian mechanics) lies with our 
ability (or failure) to match it with experience. It so happens, that the above is true to a good 
approximation when compared with reality. However, one of the principal assumptions, that 
is implicit, is that the exact measurability of the positions and velocities of all the particles 
in a system at some point of time is possible. At the turn of this century, physicists found 
it impossible to explain certain phenomena at the microscopic scale within this framework. 
This led them to question this fundamental assumption of classical mechanics, and to the 
concept of what is now known as Heisenberg’s uncertainty principle, which roughly states 
that it is impossible even in principle to simultaneously find out the positions and velocities 
of particles to an arbitrary accuracy. An investigation of the consequences of this led to the 
formulation of quantum mechanics which is a far more satisfactory theory of reality. It can 
be shown that at the macroscopic scale, classical mechanics is the limiting case of quantum 
mechanics, which explains its success. 

Even within the framework of classical mechanics, there are many systems where a slight 
difference in the initial state of the system leads to totally different behaviour in a very 
short period of time. For such systems, a quantitative prediction of their time evolution 
makes little sense. For example, consider drops of water falling out of a tap. The way they 
splash seems totally random. This is because of very tiny vortices and turbulence inside the 
tap just before the drop falls. These small differences cause each drop to splash differently. 
In other words, even if classical mechanics applies to such systems, since it is impossible 
in practice to find out the state of a system with absolute certainty, it will not take very 
long for us to lose all information about the initial conditions. A study of such systems is 
now ope of the exciting areas of research in modern classical mechanics, called dynamics of 
deterministic chaos. 


2.5 PROPERTIES OF KINETIC ENERGY FUNCTION T 

By Eq. (2.10) the kinetic energy function is T = \rmi ? where for a generally rheonomh; 
system the Cartesian coordinates and velocities are given by Eqs (2.3) and (2.5) namely, 

, .. , dXi dXi . 

Xi = Xi{qi 1 q2,...,q n ,i) and z< = + -j^qj 


This gives 


, i (dxi dxi . \ 2 

-s' U- + JS*) 


= To + T, + T, 


(2.15) 


Copyrighted material 



64 Classical Mechanics 


This equation defines To, T\ and T 2 . T 0 is only a function of the qC s and t. We can then 
write 2 

To = \ m i = fl o (say) (2.16) 

T\ is a linear combination of qj's 

T\ = djqj (say) (2.17) 

where 

= (218) 
a/s, like oq, are only functions of qC s and t. The third term can similarly be written as 


where 


T* = ^djiqjqi 


'■■■mm 


(2.19) 

( 2 . 20 ) 

( 2 . 21 ) 


Again dji 's are functions of q,'s and t only. Combining the above together, we have 

T = a 0 + djqj + -djiqjqi 

which makes all the dependences of T on generalised coordinate velocities explicit. 

Thus the kinetic energy is in general made up of a term that is independent of the 
generalised velocities, a term linear, and a term that is quadratic in them. For scleronomic 
systems T = T 2 as T 0 = T\ =0, since x,’s do not depend explicitly on time. 

It can be shown that T, To and T 2 are non-negative definite quantities, that is, T > 0, 
T 0 > 0 and T 2 > 0. For T and T 0 , this is evident from their definitions (see Eq. (2.15)). 
T 2 is non-negative because the matrix (a^/) is symmetric and non-negative definite (all its 
eigen-values are greater than or equal to 0). T 2 vanishes only if all the qi s are zero. T\ can 
be either positive or negative. 

Substituting for the full expression for T in Lagrange’s equations (Eq. (2.14)) we get 

^[^ (To + Tl+ r 2 ) ]-A ( 7 i+ T, +ra) = o J 

Thus Lagrange’s equations are second order linear coupled differential equations in qi pro¬ 
vided the Qj's are not explicit functions of £,’s. We now discuss some of the important 
types of Qj's that can exist. 

Case 1: Q, = Qj(qu...,q n ,t) 

Here Qj's are functions of the qi 's and t and not of the generalised velocities. In such cases 
there exists an ordinary potential energy function V(q \,... ,q n , t) such that by definition 

_dV 
dq } 


Qi = - 


j - l, ..,n 


( 2 . 22 ) 


Copyrighted material 



Lagrangian Formulation 65 


Lagrange’s equations then become 


or we can write, 


where 


dt \dqj) dqj dqj ~ 

£ (£L\ _ dL _ 

dt \d4i) dq, 1 1 "' 


L = T - V 


(2.23) 

(2.24) 


L = L(qi,.. ,q n ,qi,... ,q n ) is called by definition the Lagrangian of the system. The 
n equations in Eq. (2.23) are called the Euler-Lagrange equations of motion. These are a 
system of second order coupled linear differential equations in the g, variables. 


Case 2: Qj = Qj (gi,..., q n , q \,..., q n , t), also derivable from a velocity dependent potential 
energy function in the following way. 


When the Qj' s are derivable from a potential energy function U (the more general case 
will be discussed later) satisfying the equations by definition, 




dt \ dqj ) dqj 


j = l,..,n 


(2.25) 


where U = U(q \,..., q n , ,..., q„ , f) is called a generalised potential energy function, then 
we get from Lagrange’s equations of motion (Eq. (2.14)), 


where 



L = T - U 


(2.26) 

(2.27) 


is also called the Lagrangian of the system as before. Note that Eqs (2.23) and (2.26) are 
identical in form, except for the difference in the definition of L depending on the nature of 

Qf •• 

Case 3: Existence of nonpotential forces 


It is possible that there exist some nonpotential forces Q' apart from the potential 
force component (derivable either from a velocity independent ordinary potential V or a 
velocity dependent generalised potential U). In this case Euler-Lagrange’s equations take 
the following most general form 


£ (!£l\ _ £L 

dt \dqj) d qj 


= <?' 


j = 


(2.28) 


Here Q'j = Q'j(qi,...,q n ,qi,...,q n ,t) in general, and L = T — V or = T — U 
as the case may be. However, it is usually preferable to use L = T — V and keep all 
velocity dependent forces included in Q', if the inclusion of a nonpotential part is essentially 


Copyrighted 



66 Classical Mechanics 


unavoidable. 


2.6 THEOREM ON TOTAL ENERGY 


Let us consider a system which satisfies Lagrange’s equations in the following form (see Eq. 
(2.14)) 

±(dT\_dT = 

dt \dqj) dqj 


where in general, 




(2.29) 


V being the ordinary potential energy function and Q'j the jth component of the generalised 
nonpotential forces. 

By definition, the total energy of the system (E) is the sum of its ordinary potential en¬ 
ergy and kinetic energy. We want to determine its time dependence. Now, since T = 


dT 8T dT . 


dT .. 


dt ~ dt + dq^ j+ dqj* 


dT dT . d fdT . \ d fdT\ . 
dt + dqj Qj + ^ dqj 9j ) dt ^ dqj ) Qj 

(£*)*(£-«)* 


(2.30) 


dr d_ 

dt + dt 


using Euler-Lagrange’s equation in the last step. Substituting T = T 0 +T\ + T 2 (Eq. (2.15)) 
in Eq. (2.30), and using Euler’s theorem on homogeneous functions, 


dT 

dt 


=§ + 5 < r > +2r ’>- + 


(2.31) 


Since by definition E = T + V, we have for the time rate of variation of total energy, 


dE 

dt 


t(T + v) = i(2r„ + r l) -f + £ + ?,* 


(2.32) 


This is the most general result for rheonomic systems. Some special cases are considered 
below. 


Case (i) Scleronomic Systems 

For all such systems, dT/dt — 0, T 0 = 0 and T\ = 0. Thus we have 


d£ 

dt 


= W + Q ’ q> 


(2.33) 


Copyrighted material 



Lagrangian Formulation 67 


Suppose V does not explicitly depend on time, then 


T 


where P' is the power associated with the nonpotential forces. This equation tells us that 
nonconservation of total energy is directly associated with the existence of nonpotential 
forces Q'j's , even though the Lagrangian L = T — V does not have any explicit 
dependence on time in this case. 


Case (ii) Conservative Systems 

Here we consider systems which are scleronomic with no nonpotential forces. We also assume 
further that the ordinary potential energy function does not explicitly depend on time. It 
is only then 

f =0 or E = T + V = E 0 (say) (2.34) 

at 

or in words, throughout the motion, the energy of the system is conserved. Such systems 
are by definition called conservative systems. This integral of motion for any conservative 
system is called the energy integral. 


Case (Hi) Systems for which the Nonpotential Forces Q'j do not Consume Power 
For such systems, Q'j ’s exist, and the power associated with them 

P' = Q'jqj = 0 (2.35) 

At least some Q'j's must be non-zero, but the above summation has to vanish. All the veloc¬ 
ity dependent forces that have this property are known as gyroscopic forces. For scleronomic 
systems under the actions of gyroscopic forces mid an ordinary potential energy function 
that is not explicitly time dependent, the energy integral E = Eo exists. We shall show in 
the next section that certain kinds of gyroscopic forces can be included in and promoted to 
the class of generalised velocity dependent potential energy functions U. 


Case (iv) Systems Experiencing Dissipative Forces that Consume Power 
In this case, by definition of the dissipative forces Q'jqj < 0. Note that dissipative forces 
like the forces of friction are included, even though sometimes they do not do any work. 
However, the energy is generally lost in the form of heat, sound, etc. All scleronomic systems 
incurring dissipative losses should satisfy the energy condition: 

^ < 0 (2.36) 

Thus not all scleronomic systems are necessarily conservative. We now consider an important 
kind of situation which also serves to illustrate Eqs (2.35) and (2.36). 

Case (v) Systems Experiencing Nonpotential Forces that have Linear Dependence on 
Generalised Velocities 

In general, such forces can be represented by the linear matrix equation 


Q'j = Bj k qu 


(2.37) 


Copyrighted material 



68 Classical Mechanics 


where Bjk are constants (the most general case is when there is also an additive part, and 
that is discussed in the next section). B = [Bj*] is an n x n matrix and can be symmetrised 
in the following way 

B = i(B + B) + ^(£-B) = -S + A (2.38) 

Here B is the transpose of matrix B. The negative sign is chosen for convenience. 5 is a 
symmetric matrix and A is antisymmetric, that is, 

S = S and A = -A (2.39) 

Thus the associated power is 


P' = Q'jVj = [ -S + A\ jk qjqk = -Sjkqjqk + A jk qjq k (2.40) 

The second term identically vanishes because A is antisymmetric. Thus the antisymmetric 
part of B does not cause any dissipation of energy, and Q'(anti) = Ajkqk, j — 1,...,n, 
are therefore, gyroscopic forces. The Coriolis force F c = 2m(v x w) and the Lorentz force 
on a charged particle moving in a magnetic field F m = e(v x B) are examples of gyroscopic 
forces. In fact, any vector cross product is equivalently an antisymmetric tensor of the 
second rank. The power associated with these forces is F ■ v = 0, as we would expect. We 
will discuss more about these forces in section 2 . 8 . 

On the other hand, the symmetric component of Eq. (2.40) does not in general vanish as 
-Sjkqjqk 7 ^ 0. So the forces Q'j (sym) are dissipative, that is, the total energy of the system 
decreases with time, and the algebraic sign of power is negative, 

P' = -Sjkqjqk < 0 (2.41) 


This means that 


or S is positive definite. Therefore, 


Sjkqjqk > 0 


dE 

dt 


-P' > 0 


(2.42) 


is the rate of energy dissipation of this system. Now, —\P' is traditionally called the 
Rayleigh dissipation function 72, that is. 


R = 2 

This is always positive and satisfies the equation 
Vj Oq, 


(2.43) 


(2.44) 


corresponding to the symmetric (dissipative) part of B. The rate of increase of energy in 
the system is 


dE 

— =P' = -2R 


(2.45) 


Copyrighted material 



Lagrangian Formulation 69 


So for dissipative systems where the nonpotential forces are linear in the generalised veloc¬ 
ities (that is, air resistance to a good approximation or, Stokes’ law of viscosity), Euler- 
Lagrange’s equations of motion have the form 


dt\dqj) dqj + dqj 


(2.46) 


2.7 SOME REMARKS ABOUT THE LAGRANGIAN 

At this stage, we emphasize a few important properties of Lagrangian: 

1 . L is a scalar function of qj's,qj's and t. The q } variables costitute the basis vectors 
of an abstract n-dimensional configuration space. Any point in this space along witlfa 
tangent direction (qj's at that point) completely determines the state, and the future 
of the dynamical system is determined by Euler-Lagrange’s equations of motion. 

2. L is not unique in its functional form because it is possible to preserve the form of 
the Euler-Lagrange equations of motion for a variety of choices of the Lagrangian. In 
fact, given a Lagrangian, it is possible to construct any number of other equivalent 
Lagrangians (see section 2.9). 

3. For any classical holonomic system L can be constructed either from T — V or from 
T—U,as the case may be, with the condition that both T and V (or U ) must be initially 
expressed with respect to an inertial frame in which the predecessor equations namely, 
D’Alembert’s principle and Newton’s laws are valid. However, it may be noted that in 
the special relativistic dynamics, the classical definition of L = T — V or L — T — U 
is no longer valid. 


2.8 LINEAR GENERALISED POTENTIALS 


By definition from Eq. (2.25), of the generalised force corresponding to the j th generalised 
coordinate, 


1 dt \dqjJ dqj 


d 2 U 

■ ~ ~ . qi + (terms that do not contain q)) 
dqidqj 


(2.47) 


For a large number of realistic situations, Qj's do not explicitly depend on the generalised 
accelerations, hence for such systems, d 2 U/dqidqj must vanish. Thus for such systems the 
generalised potential energy can be given by 


U = Vi* + V (2.48) 

where Vi (that is, V\ ,..., V„) and the ordinary potential energy function V are all functions 


m 



70 Classical Mechanics 


of qi's and t only. We then have 

- d (dU\ dU dVj dVi . dV dV , dVj , fdVj dV { \ . , n Atx . 
Qi ~dt\dqj) dq } dt Oqj qi dq } dqj + dt + {dqi dqj ) * (2 49) 

Suppose further that the coefficients V } do not depend explicitly on time, 



that is, under circumstances, the generalised force can be thought of as a sum of an ordinary 
type of potential force dV/dqj and a gyroscopic force Qj = Aijqi, since the second term 
in the RHS is antisymmetric with respect to i and j. Thus all such gyroscopic forces can 
be included in the definition of a generalised potential energy function U, leaving only the 
dissipative forces to be included in the category of the nonpotential forces Q'j's. The latter 
can be handled by defining a suitable Rayleigh’s dissipation function, provided the drag 
forces are linear with respect to velocity. 


Charged Particle in an Electromagnetic Field 

A classic example of this case is the Lorcntz force experienced by a charged particle in an 
electromagnetic field, given by 

F = e{E + w x B) (2.50) 


where 

F)A 

E = — W - -7TT and B = V x A (2.51) 

at 

Here, <t> = <f>(r, t) is the scalar electromagnetic potential and A = A(r,t) is the vector 
electromagnetic potential. We leave it as an exercise to show that with the choice of a 
generalised potential energy function of the form 


U — e<f> - e(v • A) 


(2.52) 


that is, by setting Vi = eA, and V = e<p, we get the correct equations .of motion, namely 
Eq. (2.50). The key point here is that the magnetic force is gyroscopic in nature, and is of 
the type discussed in this section, so it can be incorporated in a suitably defined generalised 
potential U , such as the one given by Eq. (2.52). 


2.9 GENERALISED MOMENTA AND ENERGY 


In the Lagrangian dynamics, it would be proper to define both momentum and energy in 
terms of the given Lagrangian. We know that the Newtonian momentum is, by definition, 
Pi = mvi = dT/dii in Cartesian coordinates. In terms of the Lagrangian, this p, would 
correspond to dL/dii, provided the potential energy part of L is independent of both 
ijS and t. By this analogy, the generalised momentum pi corresponding to a generalised 
coordinate g, is defined as 

( 2 .53) 


Copyrighted material 



Lagrangian Formulation 71 


The generalised momentum defined in the above way is also known as 
mentum. Now suppose L = T — V, where V does not depend on the q ^s 

the canonical mo- 
;, then 

„ di dr a 

Pi dq, dq, 3^ (T,+T3) 


(2.54) 

giving 

q iPi = 2T 2 + T x = 2T - T, - 2T 0 


(2.55) 

Therefore for scleronomic systems having ordinary potential forces, 



q iPi = 2 T 


(2.56) 

If, on the other hand, L = T — U, where U — K?i + V, we get 



d(T -U) dT 

Pi= a* =dq, v ‘ 


(2.57) 


Thus the generalised momentum does not arise totally from the kinetic energy term, but also 
from the generalised potential energy term, which in the most general case being velocity 
dependent, carries some amount of canonical momentum. If the generalised potential energy 
function is linear in q t , as noted in the previous section (see Eq. (2.48)), then from Eq. (2.57), 

q iPi = 2T2 +Ti-U + V = 2T-U + V-T\- 2T 0 (2.58) 

Hence, for a scleronomic system having both gyroscopic and ordinary potential forces, 

q lPi =2T-U + V (2.59) 

Obviously, neither of the pairs of Eqs (2.55) and (2.58), and Eqs (2.56) and (2.59) is identical, 
the difference clearly arising from the two different definitions of L. 


We have already seen in section 2.5 that energy is a well-defined, conserved quantity for 
systems which are conservative in nature. A system is conservative when it is non-dissipative 
and scleronomic in nature and does not have any explicit time dependence on its ordinary 
potential energy function. Let us now sec whether under the same conditions, a given 
Lagrangian can also specify the same energy integral. For a given L = L(qi,qi,t), using the 
most general form of Eulcr-Lagrange’s equations of motion, its total time derivative is 


dL dL dL . dl.. dL dL . d (dl . \ d (dl\ . 



If the system is non-dissipative {Q\qi =0) and 
then 


d (dL . 
dt \dqi q ' 


the Lagrangian is explicitly time independent, 

-i) =0 


Copyrighted material 



72 Classical Mechanics 


Thus the quantity 

J = ~qi - L = piqi — L — const. = J 0 (say) (2.60) 

Ctqi 

is a constant of motion. This integral of motion J 0 is by definition called the Jacobi integral 
of the system, and the function J , we shall see later, corresponds to nothing else but the 
Hamiltonian, provided all <j,s are sustituted properly by functions of p,s. From Eqs (2.55) 
and (2.58), the Jacobi integral in general corresponds to 

J 0 = |ig, - L = T+ V - Tj - 2T 0 = E - T, - 2T 0 (2.61) 

oqi 

and is identical for both the definitions of L , namely L = T — V and L = T — U . 
Furthermore, dL/dt = 0 would also imply dT/dt — 0, and as an anticedent, T\ = To = 0. 
Thus, J 0 = E 0 , if L is defined as either T — V or T — U, with the only required condition 
that dL/dt — 0. 

However, the Lagrangian L need not always be defined as T — V or T — U, and the 
existence of a Jacobi integral would not necessarily mean that the system be conservative. 
In fact, there are examples of non-dissipative rheonomic systems (see problem nos. 2.11 and 
2.12), which have their Lagrangians explicitly independent of time, thus defining a Jacobi 
integral instead of an energy integral. However, for any conservative system, irrespective 
of its potential energy functions ( U or V), its total energy is always given by T + V, never 
by T + U y which is justified from the expression (2.61). The velocity dependent part of the 
potential energy U does not enter into the expression for the total energy, simply because 
the work done by gyroscopic forces is zero. 

Thus for a charged particle moving in a static magnetic field, the generalised momentum 
of the particle can be obtained from Eqs (2.52) and (2.53): 

p = mv + cA (2.62) 

The first term is the mechanical or the Newtonian momentum of the charged particle and 
the second term is a contribution from the value of the vector potential A at the point where 
the particle is presently located. Moreover, the vectors p and v are no longer collinear. For 
this system, p v — mv 2 + e(v • A) and the total energy E = T + V = |mti 2 + is an 
integral of motion. The electromagnetism is thus well described by the Lagrangian scheme. 

However, for a relativistically moving free particle, T = (m — mo)c 2 , where m = mo/ 
\/\ — u 2 /c 2 , mo being the rest mass. If we proceed to define L as L = T — V with V = 0, 
the canonical momentum p = dL/dr ^ mv leads to a point of contradiction. It is therefore 
advisable to relax the idea of defining LasT —V or T — U asa necessary condition, and 
one is let free to choose a Lagarangian which would produce the correct set of equations of 
motion and the canonical momentum (see Eq. (5.15) for example). 

2.10 GAUGE FUNCTION FOR LAGRANGIAN 

Let L(qi,qi,t ) be the Lagrangian of a system and F(^,<) be any differentiable function. 


Copyrighted material 



Lagrangian Formulation 73 


Then L + dF/dt also satisfies Euler-Lagrange’s equations of motion. For, 

d_ ( d(dF/dt) \ _ d_ fdF\ d_ \d_ fdF dF .\1 _ d_ (dF dF_ . \ 

dt \ dqi ) dqi \dt J dt [dqi \ dt + dq } V J dqi \ dt + dq^ 3 ) 

- a jL (?Z\ • _ &JL - d * F ■ 

dt \dq t ) + dqj \d qi ) 9] dqidt dq.dqj 93 
= 0 


F(qi,t) is called the gauge function for the Lagrangian. This introduces an arbitrariness 
in the form of the Lagrangian that can preserve the same form of the equations of motion 
written in terms of L. So if L = L(qi,qi,t) can produce a set of equations of motion 
through Euler-Lagrange equations of motion, then any other Lagrangian L' = £(?,, qi,t) + 
dF(qi , t)/dt formed from any arbitrary choice of F(qi,t) can be plugged in the same Euler- 
Lagrange equations of motion replacing the original L. The explicit forms of the equations 
of motion in q, and t would not be different for these two different Lagrangians. 

However, the generalised momenta corresponding to L' = L + dF/dt are 


P\ = 


dV_ 

dqi 


dL d_(dF dF.\ 
dqi + dqi \ dt + dqj** 1 ) 


x OF 

= Pi + Wi 


(2.63) 


Similarly for the Jacobi integral (or the energy in case the system is conservative) we have 


J' 



7 . . dF 
= J + qi^- 


dF _ _ dF 

dt~ dt 


(2.64) 


Thus, under Lagrangian gauge transformation, the canonical momenta and the Jacobi 
integral change in the above manner. The momenta change due to explicit spatial variation 
of the gauge function F and the energy-like Jacobi integral changes due to the explicit time 
variation of F. In the process the new Lagrangian effectively acquires a new gyroscopic 
potential energy and its ordinary potential energy is also modified. 


2.11 INVARIANCE OF THE EULER-LAGRANGE EQUATIONS OF MOTION 
UNDER GENERALISED COORDINATE TRANSFORMATIONS 


Let qi ,..., q n , be the old set of generalised coordinates and Q \,..., Q„ be the new set 
(keeping the same time parameter t for both) so that an admissible transformation between 
these two sets of coordinates be given by 

9i = qi(Qi,- -,Qn,t) 


(2.65) 


qn = q n (Ql,...,Qnit) 


So we can still retain the same n-dimensional ( R n ) configuration space, but the new set 
of the n curvilinear coordinate axes Qj's must all be suitably reoriented according to the 


Copyrighted materi 



74 Classical Mechanics 


inverse of the transformation Eqs (2.65). Now we have, from Eq. (2.65), 



* - fc + > 

which gives 

II 

^Ico 

Further, 

d ( d Qi \ dq { 

dt \dQjJ dQj 

and 

0 dt dt dqi 

■ dQj ~ dQj ~ dQj 

All these conditions are derived from Eqs (2.65), and they merely restate the same trans- 

formation Eqs (2.65) 

in the differential form. Actually, these differential conditions will be 


used to carry out the necessary transformations for the Euler-Lagrange equations of motion. 
Being differential in nature, any generalised coordinate transformation representing a con- 
stent translation in the coordinates will in no way affect the Euler-Lagrange equations of 
motion. Usually such uniform coordinate translations are ignored. This is the reason why 
one restricts oneself to a class of admissible transformations devoid of any uniform trans¬ 
lation. Such transformations are called point transformations, by definition. Translations 
are eliminated by demanding that at least one point in the configuration space must remain 
unchanged. 


Now on substitution of q, = qi{Qu -.. ,<? n> <) and ft = ft(Qi,... ,Qn,Qi, • . • ,(? n »0 in 
L(q\ i • • • >tf»i ft > • • •»ft»» t) we get the transformed Lagrangian as L{Q\,...,Q n ,Q\,..., Q n , <) 
which retains the old value of L at the corresponding points. Only the functional form of 
L changes to L in the new coordinates. Now consider, 



dLdqi_ dL dt \ 
dt\d qi dQj + dqidQj + dtdQj) 


(d£8qi_ 9Ldqi_ dL dt \ 

V dqidQj + dq { dQj + dtdQj) 

d (dL d qi \ _ \d±dqi_ dL d f dq { \] 
dt \dqi dQj) dQj + den dt \ dQj ) j 

d_(dL\dqi_ dLd ( dqj\ _ dL_ 
dt \dqi) dQj + dq { dt \dQj ) dq> 

\d_ (dL\ dL 1 dqj 

[dt \ dqi) dqi\ dQj 


dQj 


&L_d_ ( dqj \ 

dqjdt \dQj) 


Thus if the most general Euler-Lagrange’s equations of motion with Cartesian components 


Copyrighted material 




Lagrangian Formulation 75 


of nonpotential forces as F' k are valid in terms of the old set of coordinates, that is, if 



is also valid for every Qj coordinate. 

Therefore, Euler-Lagrange’s equations of motion, in their most general form, remain 
invariant in form under the most general (that is, time dependent) generalised coordinate 
transformation. 

This is simply a remarkable result. While introducing the idea of the generalised coordi¬ 
nates towards the beginning of this chapter, we could not prescribe any method by which 
one can choose a set of generalised coordinates. Now it is obvious that if n be the number 
of the DOF of a holonomic (and bilateral) system, one can choose any set of n independent 
coordinate variables with some explicit prescribed relations given in the form of Eq. (2.1), 
express the Lagrangian in terms of these n independent coordinates and time and yet can 
write the same Euler-Lagrange’s equations motion (given in the form of the Eq. (2.14)) in 
order to derive the equations of motion in these coordinates. So far as the basic form of 
Euler-Lagrange’s equations of motion are concerned, it really does not matter at all what 
set of generalised coordinates are chosen for describing the motion. But if we want to write 
Newton’s laws of motion in a given set of generalised coordinates, there is no standard 
or unique form of the equation of motion available except for the rectangular Cartesian 
coordinates, where we ought to worry about the constraint forces. In the Euler-Lagrange 
formalism, the basic equations of motion have got a unique form given by the Eq. (2.14), 
which must remain strictly valid for any complete set of generalised coordinates. Only major 
task is then to formulate the explicit form of L in term of the chosen set of generalised 
coordinates. 

Next consider the quantity 


™ 


= (Sl it) f«2i* + 22i^ 

\84idQi) \d„ q,+ dt ) 
_ dLdqi_dQ±. dL dg t dQ } 
dqi dQj dq, qi + dqi dQj dt 
_ dL dqi . dL dqi 
~ d4idqi qt ~ ^ * r 


dL . 
dq/ 


dqi 
Pi dt 
dqi 


Pi9i ~ Pi~ZT 


Therefore the quantity PjQj remains invariant only if qi = qi(Qj), that is, the transfor¬ 
mation does not depend explicitly on time. Again since Pj = Pi(dqi/dQj), the transformed 


Copyrighted 



76 Classical Mechanics 


momenta, change from the original ones because of the pure (time independent) coordinate 
transformation, the transformed energy-like Jacobi integral PjQj — L can differ form 
its original Jacobi integral J = piqi — L, only if the coordinate transformations are 
explicitly time dependent (that is, ( dqi/dt ) ^ 0). So this provides a way by which one 
can preserve the value of L and the form of Euler-Lagrange’s equations of motion, but the 
values of the Jacobi integral and canonical momenta may change simply due to the imposed 
coordinate transformation leading to reorientation and differential expansion/contraction of 
the coordinate axes drawn in the same old n-dimensional configuration space. 


2.12 CYCLIC OR IGNORABLE COORDINATES 


The Lagrangian of any physical system is generally expected to have explicit dependence 
on all the generalised coordinates <?,, all the generalised velocities q, and time l, that is, 

L = L(qi t ...,q„,qi,t) 

where n is the total number of generalised coordinates. Due to some reason if some of the 
generalised coordinates do not appear explicitly in the expression for the Lagrangian, these 
coordinates are by definition called cyclic or ignorable coordinates. Any change in these: 
coordinates cannot affect the Lagrangian. 

Example: The Lagrangian for a projectile, moving under the earth’s approximately constant 
field of gravity is given by 

L = ^mv 2 - mgz = ^m(i 2 + y 2 + z 2 ) — mgz 

where g is the constant acceleration due to gravity. Here (ar, y, z ) is the set of the Cartesian 
coordinates of the projectile of mass m, z being the vertical component (upward positive). 
Obviously, in this example, x and y are the cyclic coordinates. 


We now prove the following theorem. 

Theorem: In absence of any nonpotential forces, the generalised momentum correspond¬ 
ing to any cyclic coordinate is a conserved quantity. 


Proof: Let y, be a cyclic coordinate, then by its definition 



Euler-Lagrange’s equation for the same coordinate reduces to 



Therefore, 


dL 

dqi 


is a constant of motion 


Copyrighted material 



Lagrangian Formulation 77 


In the above example, p y = QL/dy = my and p z = dL/dx = mx are the two 
conserved components of linear momentum, corresponding to the cyclic coordinates y and 
x respectively. As expected, the horizontal components of the linear momentum of the 
projectile are conserved. 

2.13 INTEGRALS OF MOTION 

For any mechanical system there exist some functions of qi's and #’s, whose values remain 
constant throughout the motion of the system, in spite of the fact that the values of qi’s 
and gi’s are all changing with time. The former set of functions are by definition called the 
integrals of motion. 

The general solution of Euler-Lagrange’s equations of motion, for any mechanical system 
having the number of DOF = n has the form 

qi = 9»(<,Ci,...,c 2 „) * = 

and 

qi = 9»(<,ci,...,c 2 „) i = 1 ,...,n 

where ci,.. .,c 2u are the 2n constants of integration. One can arbitrarily choose any one 
of these 2n functions and construct an expression for t by inverting the relation and then 
substitute this expression for t in the rest 2n - 1 functions. Therefore, one can have a 
maximum number of 2n — 1 independent integrals of motion for any mechanical system 
having n degrees of freedom. To see that these 2n - 1 expressions are the constants of 
motion, note that these expressions -can be inverted to express the constants cj,... ,c 2n _ i 
as functions of ( q j, i = 1,... ,n), which are also the integrals of motion. There can be 

infinitely many ways of expressing the constants of motion, but only a maximum of 2n - 1 
would be independent of each other and have no explicit dependence on time. Of course, 
there would always remain one more independent constant of motion, which will have an 
explicit dependence on time. 

The significance of the existence of 2n - 1 constants of motion will become clear when 
we talk about the phase space of 2n dimensions. The specification of 2n - 1 functions 
in that space would simply mean that only one degree of freedom will be left, which will 
of course be a curve in the 2n -dimensional space. Obviously, this curve ought to represent 
the unique trajectory of the system in the phase space subject to the specification of all the 
2n - 1 integrals of motion. Along the trajectory time will change, and this is going to be 
fixed by the one remaining, namely the explicitly time dependent constant of motion. In 
the n -dimensional configuration space, the system will also describe a unique trajectory, 
but for a given trajectory, there would be n independent choices of the initial conditions 
that would lead to the same final trajectory. This tells us how important it would be to 
consider phase space for it uniquely (or canonically, meaning providentially destined) defines 
the trajectories. 

2.14 CONCEPT OF SYMMETRY: HOMOGENEITY AND ISOTROPY 

If any system or any function representing a property of the system does not change under 


Copyrighted material 



78 Classical Mechanics 


some operation (defined on the system), the system is said to possess a symmetry with 
respect to the given operation. 


2.14.1 Examples 

1. When a cylinder is rotated about its axis by an arbitrary angle its apparent shape does 
not change. The cylinder is said to have rotational symmetry about its axis. 

2. The size, shape and position of a homogeneous sphere remains invariant under any 
arbitrary rotation about any axis passing through its centre. By measuring any property of 
the sphere, it is impossible to detect whether such a rotation has at all taken place. As a 
consequence, the form of the equation of a sphere with respect to an origin coinciding with 
its centre does not change no matter how we choose the directions of the x,y,z axes. 

3. We wish to give this example of symmetry operation expressed in an abstract manner. We 
know that Euler-Lagrange’s equations of motion do neither change their form under point 
transformations nor do they change if we add to the Lagrangian a total time derivative 
of any arbitrary function F(q,t). The latter is called the Lagrangian gauge symmetry of 
Euler-Lagrange’s equations of motion. 

Homogeneity of space: If for any arbitrary displacement of the origin of any reference frame 
the physical properties of all closed systems remain unaffected, we say that space is homoge¬ 
neous. Thus every point in space is equivalent to every other point for the description of the 
state of motion of any closed system, such as the universe. A closed system , by definition is 
the one that is not acted upon by any field of force, whose source is external to the system. 

Isotropy of space: If for any arbitrary rotation about the origin of any reference frame the 
physical properties of any closed system remain unaffected, we say that space is isotropic. 
Thus every direction in space is as good as any other direction for the description of any 
closed system, and the choice of the Cartesian axes can be arbitrary in direction. 

Homogeneity of time: If for any arbitrary displacement of the origin of time, the physical 
properties of any closed system remain unaffected we say that time is homogeneous. Every 
moment of time is as good as any other moment of time for the description of a closed 
system. 

Note that the above properties of space and time mean the invariance of physical prop¬ 
erties under certain kinds of symmetry operations. The configuration and states of motion 
related by these operations are equivalent. These symmetries correspond to invariance under 
arbitrary translation, arbitrary rotation about arbitrary axis and the operation of ‘waiting’, 
that is, the passage of time, respectively. Conversely, space can be said to be homogeneous 
and isotropic, and time as homogeneous, only if the states of motion of all closed systems 
are found to be invariant under these operations. Thus the homogeneity of space and time 
and isotropy of space are guaranteed only for a system which is not acted upon by any 
external force. 

A closed system is described as any other system by its Lagrangian. In particular, we 
require that the Lagrangian of a closed system should be invariant under the operations of 


Copyrighted material 






Lagrangian Formulation 79 


translation and rotation in space and that it should not change just by waiting, that is, it 
should not depend explicitly on time. 

Consider a frame of reference in which space and time are not homogeneous and space is 
not isotropic. Such a frame may be realised if it is accelerating with respect to some fixed 
frame. In this frame, the physical state of motion of a closed system may change by mere 
translation in space or by mere passage of time. A change in the physical state of motion of a 
system will, in general, involve a change in its acceleration. Thus a system which was at rest 
at some time may suddenly start moving after some time, seemingly without experiencing 
any action of the external force of any kind. This means that Newton’s laws of motion are 
not valid in such a frame. We have defined inertial frame of reference to be the frame in 
which Newton’s laws of motion are valid. Thus we require that in an inertial frame space 
and time must be homogeneous and isotropic, for the description of any closed system. This 
can be taken to be an alternative definition of the inertial frame. 

We have seen that the Lagrangian of a closed system must be invariant under the trans¬ 
lations and rotations in space. In other words, these are the symmetry operations for the 
Lagrangian. These symmetries in Lagrangian have very important consequences. Each of 
these symmetries corresponds to a conservation law, giving rise to a conserved quantity or 
an integral of motion which is additive. Additivity means that the value of the integral of 
motion for the whole system is the sum of its values for various parts of the system. 

It turns out that every symmetry in the Lagrangian corresponds to a conservation law. 
This statement was rigorously proved by Emmy Noether in 1918, and is called Noether's 
theorem. We shall prove Noether’s theorem later in chapter 6. At the moment we wish to 
obtain the conservation laws bred by the homogeneity of time and space, and the isotropy 
of space. 

(a) Homogeneity of Time and Conservation of Energy 

The Lagrangian of a closed system should not have any explicit dependence on time so 
that ( dL/dt ) = 0 and hence, using Lagrange’s equations involving nonpotential forces of 
external origin, 



However, for a closed system, all the components of the external forces are zero including 
the nonpotential ones giving Q' { = 0 so that the Jacobi integral becomes a constant of 
motion. 

Now since there is no external force, V (and/or U) reduce(s) to a constant, say V a , and 
L = T - V„. Since L is independent of time, so is T as V is a constant, and therefore, 
T„ = T\ = 0 giving T — T<i- The system is therefore totally scleronomic, implying that 


J = E = T + V a = const. 


Hence total energy is conserved for any closed system due to homogeneity of time. 

(h) Homogeneity of Spaee and Conservation of Linear Momentum 

The Lagrangian of a closed system should not change due to any arbitrarily small uniform 
translation for all particles. In Cartesian rectangular coordinates this translation can be 


Copyrighted material 




80 Classical Mechanics 


written, for the xth particle 

r* —► ri + Sri 

where Sri = e is a constant vector, infinitesimally small in magnitude. We have 

0 for any arbitrary translation e which implies, for the whole 

Adding the corresponding terms in Euler-Lagrange’s equations of motion for all particles, 


We require that SL 
system 


E^(t)-E^ = o 


« 

Therefore, the total linear momentum 


*(?*)-• 


(say) 


( 2 . 66 ) 


is the quantity that is conserved. Thus the total linear momentum of any closed system is 
conserved due to the homogeneity of space. 

(c) Isotropy of Space and the Conservation of Angular Momentum 

The Lagrangian for a closed system should not change due to any arbitrary small rotation 
of reference frame about some arbitrary direction. 

An arbitrary small rotation 60 about some direction n passing through the origin brings 
about a change in any vector A given by 

6A = (SOh) x A (2.67) 

Therefore all position vectors rj will change by 

6n = (SOh) x r, 

and the velocity vectors u, by 

Svi = (SOh) x Vi 

Since all vectors ought to change in the above fashion, the following proof is going to be 
valid only if L for the closed system does not contain any vectors other than r and p; not 
even a constant vector such as the dipole moment vector is allowed to be present in L , 
though L remains always as a scalar. 


Copyrighted material 



Lagrangian Formulation 81 


Therefore for any closed system 



= (Pi • fa + Pi ■ 6 v i) 

= M Y \Pi ' (*» x r 0 + Pi (» x ®i)l 

= W E[|(* x *>•*] 

= x *) 

Since 66 is arbitrary and 6L — 0 due to the required isotropy of space for the description 
of a closed system, we have 



Y r * x Pi = L (say) 

is the conserved quantity which, by definition, is the total moment of momentum or the 
total angular momentum about the origin of the closed system. 


2.15 INVARIANCE UNDER GALILEAN TRANSFORMATIONS 


We have defined inertial frames as those in which Newton s laws of motion are valid, or 
alternatively, in which the space and time are homogeneous and space is isotropic. Consider 
two inertial frames S and S' in relative motion. In general, various kinematical and physical 
quantities pertaining to the system will have different values in these two inertial frames. 
The problem is to obtain relations between the values of a given quantity measured with 
respect to these two inertial frames. 


An immediate consequence of the requirement that Newton’s laws be valid in both the 
frames is that the forces acting on the system must be the same in both the frames, that is, 
for the tth particle 

dV, „ (Pti 

m ' dt 2 “ Fi ~ mi dt* 


where i is not summed over and we have assumed that the time is universal, that is, it 
always has the same value in both the frames. r< and r • are the position vectors of the 
*th particle in S and S' respectively, that is, r becomes r' as we go from S to S'. Thus we 


Copyrighted material 



82 Classical Mechanics 


must have 


- «■;•) 

dt 2 


ri - r'i = u a (t - t 0 ) = u„t 


if the origins of S and S' coincided at i = 0, rather than at i = t a . The constant of 
integration, t* 0 stands for the constant relative velocity of S' with respect to S. Thus we 
get the basic transformation equations 


r'i = ri - u„t 


( 2 . 68 ) 


with the implicit assumption t' = t. The frames S and S' are said to be connected by a 
Galilean transformation if the transformation equations are by definition given by the Eq. 
( 2 . 68 ). 

Let us now see how the Lagrangian of any system transforms between S and S'. In the 
case of the motion of a single particle moving under an external field of force due to an 
ordinary potential V, the Lagrangian in S' is given by 


V = -mv 1 - V 

= |m|t> - tt„| 2 - V 

1 2 1 2 
= -mv — V - mv • u 0 + -mu„ 

= L + 5 - m “« r ) 

= * + < S ay> 


Note that V has remained the same because V is normally a function of r 2 - T\ and 
r 2 - **i = r' 2 - r'i at all instants by Eq. (2.68). Thus through the above gauge function 
F(r,t) (see section 2.9) both L and V must satisfy the same Euler-Lagrange’s equations 
of motion, that is, the latter preserve their form in S'. This is effected by the Lagrangian 
gauge function 

F(r,<) = jmujl - mu 0 r (2.69) 

provided u„ is the constant relative velocity of S' with respect to S. So the coordinate 
transformation corresponding to any Galilean transformation can be viewed as a Lagrangian 
gauge transformation. Thus instead of performing the coordinate transformation given by 
Eq. (2.68) we can also directly transform the Lagrangian using the gauge function given by 
Eq. (2.69) and obtain the same results. 

It is also easy to see that in Eq. (2.68) one can eliminate t*„ to obtain 


*5 - r< « M - h)t 


so that 


r'i - v'it = 


- Vit 


For the whole system this appears to suggest that the quantity £, m,(i\ - «,•<) has the 


Copyrighted 








Lagrangian Formulation 83 


same value in all inertial frames, which are of course, connected by Galilean transformations. 
Or in other words, 

]£ m ‘ r * - Y* 1 = MR ~ Pi ( 2 - 7 °) 

is a constant of motion arising out of the symmetry implied by the Galilean invariance of 
Newton’s equations of motion. Here M = ^ m* is the total mass, R is the position 
vector of the centre of mass and P is the total linear momentum of the system. 

Thus for a closed system we have constructed in all ten additive constants of motion, of 
which three are due to linear momentum, three are due to angular momentum, three are 
due to Galilean invariance and one due to energy. 

Now if t»„ were a function of t so that frame S' becomes noninertial and if we insist to 
write for single particle motion 

L' = im|»f - V 

so that the equations of motion in S' are then given by 

£ ( 2 £\ \ ££ _ , 

) dr' 

which means 

mb' + VV = 0 (2.71) 

But at any instant t, v' = v - u„, so that the Eq. (2.71) implies 

mv - mu 0 + VV = 0 (2.72) 

So we do not get back the original equations of motion of the system in S if we define V 
in a noninertial frame in the same way as we define in the inertial frames. We have an 
extra force term - mu„ added to the system, consequently violating Newton’s second law 
which must apply only to the inertial frames such as S. Equation (2.72) is different from the 
original mv + VV = 0. Thus the form of Euler-Lagrange’s equation is not preserved in 
a noninertial frame. Since Newton’s laws are not preserved, D’Alembert’s principle is also 
not valid in the noninertial frames and hence the Lagrangian cannot be constructed simply 
from T = mv'* / 2, even if V remains unchanged. 

It is then important to note that the Lagrangian must always be constructed with respeci 
to an inertial frame, although it may involve quantities referring to a noninertial frame. An 
expression for kinetic energy must be constructed in an inertial frame first, and then by 
substituting for inertial velocities in terms of the velocities and coordinates of the noninertial 
frame, one would finally obtain a Lagrangian, to be used in terms of the coordinates and 
velocities with reference to the noninertial frame. This point will be more fully discussed in 
chapter 3, for a particular class of noninertial frames, called the rotating frames of reference. 

2.16 LAGRANGIAN FOR FREE PARTICLE MOTION 

A free particle is one which is not acted upon by any external force at all. Hence the 


Copyrighted material 





84 Classical Mechanics 


potential functions V or U can be taken as zero, or at most a constant. Therefore, the 
Lagrangian L for any free particle is essentially equal to its kinetic energy T. Let us sec 
how the expression for T takes different forms in different coordinate systems. In this case, 
the number of DOF is three, so the number of independent integrals of motion is five. 

2.16.1 Rectangular Cartesian Coordinate System 
The Lagrangian is 

£ = r = i m(x 2 + j 2 + i 2 ) 

All the three coordinates are cyclic giving 

dL 


dL 

p z = — = mx 


m — p> ~ m 

as the constants of motion. The energy integral is 

E 


= my 


dL 

P* ~ ~di ~ mZ 


dL. dL. dL. _ 1 . 2 2 

Tx X + Ty V + Tz Z ~ L = 2m Px + P » + Px) 


and is obviously a constant of motion. But if p x , p y , p s are assumed to be independent 
integrals of motion, then E is not. The other two independent integrals of motion, though 
not obvious, sire any two components of the angular momentum. 


2.16.2 Cylindrical Polar Coordinate System 
In this system the Lagrangian is 


L = T = -mu 
2 


■ HS)’ 


= lm(r 2 + r 2 0 2 + z 2 ) 

Here, for the same free particle, only 8 and 2 are cyclic coordinates giving p$ = ( dL/dO ) = 
mr 2 0 = angular momentum about the 2 -axis passing through the origin and p t = mz as 
two independent constants of motion. Therefore this coordinate system is not as good as 
the Cartesian system for the description of a free particle motion so far as the number of 
the cyclic coordinates are concerned, but the Cartesian frame did not make it obvious that 
the angular momentum about the 2 -axis can be a constant of motion. 


2.16.3 Spherical Polar Coordinate System 
With respect to this frame of reference the Lagrangian is 
1 


L - T 


Hi)' 


= ^m(r 2 + r 2 0 2 + r 2 sin 2 


Copyrighted material 



Lagrangian Formulation 85 


Here, only <p is cyclic giving p$ — mr 2 sin 2 9j> = constant, which is the angular momentum 
about the z-axis. Other constants of motion are hidden. 

One can write down the equations of motion in each case and solve them. Obviously, 
these will be the parametric equations of straight lings in different coordinate systems, with 
t as the parameter. 

Throughout the book, we shall come across the expressions for Lagrangians for a number 
of physical situations. So we refrain from working out examples in this chapter. The 
problems suggested at the end of the chapter can be worked out with the help of the hints 
given at the end of the book. 


2.17 LAGRANGE’S EQUATIONS OF MOTION FOR NONHOLONOMIC SYS¬ 
TEMS 


We have noted in section 2.3 that for a nonholonomic bilateral system having k' non- 
holonomic constraint relations and n degrees of freedom, the number of quasi-generalised 
coordinates required is n + k\ where n = ZN - k — k ', N being the number of 
particles and k being the number of holonomic constraints. 

If all the nonholonomic constraints are given by 

9i = ••><7n + v, tfi,..., $« + *',<) = 0 (2.73) 

where t = 1,... y k' t following the argument given in section 1.5, the generalised constraint 
forces arising out of Eq. (2.73) would be given by 


(w. = J>|| < 2 - 7 <> 

where A* are the Lagrange multipliers. Equations ((2.14) or (2.28)) have so far been the 
most general form of Euler-Lagrange equations of motion for a holonomic system having 
nonpotential forces Q'j{q,q,t). Now Eqs ((2.14) or (2.28)) can further be generalised to 
include any nonholonomic system experiencing forces of both the potential and nonpotential 
types to take the form, 


d ( dL\ 6L ^ % dg { 

dt \dqj e qi - Q > + <<?,) ' - Q > + £** 

d_ (dL\ _M_f 

dt\diiJ 8q, ^ dq, 


(2.75) 


Here L is as usual defined to be either T - V or T - U depending on the nature of the 


Copyrighted 



86 Classical Mechanics 


potential forces with 

1 3N 

T = j £ mii * 

Xi = Xi(gi,...,g 3 w- k,t) for * = 1,...,3JV 
V = V(q l ,...,q iN _ *,<) or 

U = £%i>- ->93N - ik,9i*-**,9siv - *,<) and 
< 7 i = 0 for t = as given by Eq. (2.73) 

Equation (2.75) is valid for any type of nonholonomic constraints. But we know that cer¬ 
tain classes of nonholonomic constraint forces do not do any virtual work, provided they are 
expressible as homogeneous functions of velocities except for an additive arbitrary function 
of coordinates and time (see section 1.7). For such simple nonholonomic D’Alembertian 
systems, Eq. (2.75) takes the following form: 

S !*(£)-S1--- 

where not all tfq/s but only n of them can be arbitrary. This is of course the most gen¬ 
eral form of D’Alembert’s principle expressed in terms of the quasi-generalised coordinates. 
Obviously, in absence of any nonpotential generalised forces, SW = 0. This extension of 
Lagrange’s formalism to nonholonomic systems was first done by Ferrers in 1871. 

A worked out example of a case of the nonholonomic constraints: 

The motion of a village cart wheel that is rolling on an incline without slipping. 

Let us assume that the wheels of the cart have radius b, the separation between the wheels 
is a, the angle of inclination of the incline a, the incline runs down along - y axis, x axis 
is horizontal. 

Now the coordinates to describe the motion are as follows: 

(x,y) = the rectangular Cartesian coordinates for the location of the centre of mass of 
the entire cart wheel system projected on the plane of the incline, 

0 = the angle between the axle and y axis, and 
<t>u<h = angles of rotations in the planes of the wheels. 

So these constitute five coordinates. 

Then come the constraints: The vector condition for no slipping gives rise to the following 
constraints 

(i) an integrable or holonomic constraint given by add = b(d<f >i + <f<fo), or 

b = -(* + ^ 2 ) 

a 

(ii) the differential displacement da of the centre of mass has a direction always perpen- 


Copyrighted 



Lagrangian Formulation 87 


dicular to the axle, or 

ds = - dfa) 

This second constraint is not integrable as ds is not a coordinate. Actually, this constraint 
is equivalent to two nonholonomic constraints, given by 

dx = ^(</0i - </0 2 )cos0 and dy = ^(d0i — d 02 )sin 0 

So there are three constraints in five coordinates, giving the number of DOF = 2. But 
because of 2 nonholonomic constraints the minimum number of quasi-generalised coordinates 
required is 4. 

The gravitational force on the system is F y = - Mg sin a, M being the total mass of 
the system, the potential energy is Mgysxna. The kinetic energy is 

T = + \h{4] + 41) 

where/, = I a + 2 h + ma 2 / 2, I c = mb 2 / 2, m being the mass of individual wheels. 

The constraint relations are found to satisfy a 2 0 2 + 4i 2 = 2b 2 (<p 2 + 0 2 ), thus 

eliminating 0 2 , we have, 

T = + \v(4\ + 41) 

where y = M + 2I c /b 2 , (3 = /, 4- a 2 I c /2b 2 , and hence the Lagrangian 

L = \ + V 2 ) + + + 0 2 ) - Mgy sin a 

now being describable in terms of 4 quasi-generalised coordinates x,y,0i,02- 

Writing Lagrange’s equation in the form of D’Alembert’s principle in quasi-generalised 
coordinates, 



on rewriting 

yxSx -I- (yy + Mgsina)Sy + 0(0i + 0 2 )(60 i + 6fa) = 0 

Now if we eliminate 6x and 6y using the nonholonomic constraint relations, 60 \ and 602 
become arbitrary, and hence their coefficients vanish, giving 

01+02 = 0 = const. = u; (say) 


and 


s =--sin a sin 0 


where x = scos0 and y = 6sin0 are used. The second equation has a solution 
Mu 


s = —— sin a. cos 9 + v a 

fUJ 


Copyrighted 



88 Classical Mechanics 


v 0 being the constant of integration, implying the speed of the centre of mass when the 
axle is moving parallel to the x axis. Thus the solutions for x and y are 

. Mg sin a., . 

x — scos9 — —^-(i cos20) + u 0 cos 9 

2fiu> 

and • 

. . Mg sin a . . . 

y = ssin9 = —--sin 29 + v 0 s\n9 

2\uj) 

So the motion in y is purely oscillatory, but that in x has a constant time average, 
Mgsina/2/juv. Therefore, the cartwheel will be drifting horizontally but oscillating along 
the slope of the incline. Recall that 9 = u>t + constant. 


The general solution for x and y now becomes 


x 


Mg sin a 
4/uj 2 


{29 + sin 29) + x a 


and 


y 


Mg sin a 
4 mv z 


2 9 + y a 


These equations describe a cycloid with cusps pointing along the + y axis, the total vertical 
amplitude = Mg sina/4/iu; 2 and the horizontal separation between the consecutive cusps 
as 7T times the vertical amplitude. 


Now if a = 0, that is, the cartwheel is moving in a horizontal plane instead of an incline 
without allowing its wheels to slip, the centre of mass will execute a purely circular motion. 


2.18 LAGRANGE’S EQUATIONS OF MOTION FOR IMPULSIVE FORCES 

Consider a holonomic system containing N particles, described by n generalised coordinates 
qi,..., g n - Let a large external force F(<) act on this system for a very short time say 
between l and t + At. F{t) may also vary rapidly within the time interval A t over which 
it acts. This situation is realised during collisions of material bodies. Instead of dealing 
with the impulsive force F(t), whose variation with time during t is generally unknown, it 
is advantageous to deal with a quantity called impulse of F(l) defined as 

/ t+ At 

F(l') dt' (2.77) 

Since the duration of impact is very short we can assume displacements to remain un¬ 
changed during the impact whereas the velocities can be assumed to change almost instan¬ 
taneously. This is because, in response to finite changes in velocities, displacements take a 
finite time to develop. This means that the following limit exists 

P(<) = lim P{iyt + At) (2.78) 


Copyrighted 




Lagrangian Formulation 89 


where 


lim = limit as At —► 0 |F(<)| —► oo and P(t } t + Ai) is held const. 

The quantity P(t) is called the instantaneous impulse (or impulse for short) and has the 
dimension [F] [T]. 

The formulation of the above problem in terms of generalised coordinates can be done as 
follows. Let the holonomic system mentioned above be acted upon by a force between the 
time t and t + At and let Qi be the generalised component of the force corresponding to 
the generalised coordinate q { . Then the generalised component of the impulse corresponding 
to the generalised coordinate qi applied to the system between t and t + At is defined as 

Qi(t,t + At) = + Qi{f)d? (2.79) 

Again we define an instantaneous impulse associated with Qi(t , t + At) as 

Qi(<) = limQ<(M + A<) (2.80) 

where 


lim = limit as At 


lQi(t)\ 


oo and Qi(t,i + At) is held constant (2.81) 


We can now modify Euler-Lagrange’s equations to allow for the impulsive forces. Euler- 
Lagrange’s equations of motion for a holonomic system can be written as 


d_ 
dt ' 


\dqj a Si 

1 1 + At take the limi 

r« +d fdr\ r t + At dr /■* + ** 

lim / dF {.Wi) dt ~ lim / Wi dt=Um l Q ' dt ' 


(2.82) 


If we integrate Eq. (2.82) from t to t + At take the limit defined in Eq. (2.81) we get 
/•* + At a / &r\ r t + At frr rt + At 


The first term can be reduced to 

4* At j / syp \ fyp I ^ 

,im / «(«)*-"“M. ] 4 ,_,= a » (2 - 83) 

where p, = dT/dfc is the generalised momentum corresponding to the generalised coordi¬ 
nate qi. 

Now the integrand in the second term in Eq. (2.82) remains finite during the impulse, 
therefore 

/■t+At , 


rt + o.tQT 

lim / ^-dt' = 0 

Jt 9<li 


and from Eq. (2.80) the term on RHS is just the instantaneous impulse <?t(<) or Qi for 
brevity. Thus Eq. (2.82) reduces to 

A Pi = Qi (2.84) 

Equation (2.84) states that the incremental change in the generalised momentum is equal 


Copyrighted material 



90 Classical Mechanics 


to the generalised impulse. 

The value of the generalised impulse can be obtained from the expression for virtual work 
done by the impulsive forces. Let F\ ,..., Fn be the impulsive forces on the system of N 
particles. Then 

£«*««* = £ F r lr, 

k = 1 j - 1 



where n is the number of degrees of freedom. Comparing term by term we get 

5 < 2 - 85 > 

In the above formulation we have assumed that the constraint forces are not impulsive. 

Finally we note that systems subjected to impulsive forces are not generally conservative 
since energy is dissipated during impact of bodies. 

Example 

Consider a double pendulum in which the masses mi and m 2 are connected by massless 
rigid links of lengths l\ and I2 respectively. We obtain Euler-Lagrange’s equation of motion 
for the case in which a source of an impulsive force P strikes horizontally at a distance d 
from the support when the links are at rest in the vertical position. Consider the case in 
which Zi < d < l\ + l 2 - 

The value of the kinetic energy corresponding to the vertical position is 

T = iro,(J,0i) a + i + khf (2.86) 

Denoting by Q\ and Q 2 the impulsive forces associated with the generalised coordinates 
0i and 02 , we can write the expression for the virtual work 

P 6 [l i0j + {d - /i)0 2 ) = P[Zj00i + (d - h)S0 2 ] = Qi00, + Q 26 O 2 (2.87) 

which gives 

Qi = Pl\ and Q 2 = P(d - h) (2.88) 

and we note that Q 1 , Q2 are impulsive moments rather than forces. Since the momentum 
before the application of P is zero, we have 

Ap, = Pi = ^ i * 1,2 (2.89) 

Oui 


Copyrighted 



Lagrangian Formulation 91 


so that using Eq. (2.84) in conjunction with Eqs (2.86) and (2.88) we obtain 
+ m2li(li9i + hfo) = Pl\ 

1712/2(^1^1 + hfo ) — P {4 ~ /1) 


which are the desired equations. 


(2.90) 


2.19 SUMMARY 

D’Alembert’s principle is not of much use unless the possible displacements 6 xi are made 
absolutely independent of each other. In the presence of constraints, the constraint relations 
have to be satisfied by the 6 xi s, making them dependent on one another. If there are k 
holonomic constraints and N particles, 3 N — k should be the maximum number of 
possible independent coordinate displacements. 3 N - k is called the number of degrees of 
freedom of the system. Any set of 3 N — k independent coordinates, called the generalised 
coordinates for the holonomic system, are defined and used by Lagrange to break up the 
single equation of D’Alembert’s principle into 3 N — k independent equations of motion. 
These equations are called Lagrange’s equations of motion of the second kind. At about 
the same time Euler also developed variational calculus and obtained Lagrange’s equations 
of motion from general principles, and hence these equations are often regarded as Euler- 
Lagrange’s equations of motion. 

The Lagrangian £ is a scalar point function described in a 3 N — k -dimensional configu¬ 
ration space, but it cannot be uniquely specified because of the functional dependence of L 
on the coordinate velocities and time. The generalised momenta p,- were defined in terms 
of the Lagrangian, which is either given or constructed from its definitions L = T — V, 
or L = T — U, where T is the kinetic energy, V is the ordinary potential energy and U 
is the generalised potential energy, as the case may be, for defining L. 

For every particular generalised coordinate that is absent or cyclic in the expression for 
the Lagrangian, Euler-Lagrange’s equation of motion for that particular coordinate leads to 
the conservation of the generalised momentum, conjugate to the cyclic coordinate. If the 
Lagrangian is time independent, it must conserve an energy-like quantity called the Jacobi 
integral, which can be identified with the actual energy provided both the ordinary potential 
energy function and the kinetic energy function are independent of both velocity and time. 
Such systems are called conservative systems. It is shown that the generalised potential 
energy functions can in most cases be conveniently represented by a sum of ordinary potential 
energy function and a term originating due to some gyroscopic forces of antisymmetric 
nature. 

The preservation of the explicit form of the Euler-Lagrangian equations of motion in terms 
of the Lagrangian does not require that the Lagrangian be unique. In fact one can add a 
total time derivative of any scalar point function of coordinates and time to the physical 
Lagrangian and still retain the same form of the Euler-Lagrangian equations of motion in 
terms of the new Lagrangian. Such an invariance, duly recognised as a symmetry property 
of the Lagrangian functions, is sometimes referred to as invariance under Lagrangian gauge 
transformation. Such changed Lagraagians would not give the equations of motion when 


Copyrighted 



92 Classical Mechanics 


explicitly written out in terms of the coordinates, differing from those written out for the 
original Lagrangian. It is also shown that Euler-Lagrange’s form of the equations of motion 
remains invariant under any generalised coordinate transformation connecting one to the 
transformed set. 

Expected symmetries of the Lagrangian for any closed system under, say, infinitesimal 
translation, rotation, shift in the origin of time or Galilean transformation between two 
moving inertial frames, by Noether’s theorem, result in the conservation of ten quantities 
— linear momentum, angular momentum, energy and centre of mass motion. 

The Lagrangian formalism is extended to incorporate dissipation of the Rayleigh type 
and further to include the nonholonomic systems by Ferrers. For nonholonomic systems, 
the generalised coordinates are not sufficient. The total number of coordinates required is 
still 3 N — k, even though an extra number of k' nonholonomic constraints are present. 
Such coordinates are called quasi-generalised coordinates, or simply quasi-coordinates. 


PROBLEMS 

2.1 Find the number of degrees of freedom of the dynamical systems described in problem 
number 1.1. What are the suitable generalised coordinates that one can select for 
these? 

2.2 During the boiling of any liquid, a phase transition takes place from its liquid to 
the vapour state. Assume that during the process of boiling individual molecules 
do not change their vibrational state of motion, but all their possible translational 
and rotational degrees of freedom are suddenly restored. Using Boltzmann’s law of 
equipartition of energy, namely an increase in energy by \kT\, per molecule per 
degree of freedom, calculate the contributions to the latent heat of vaporisation of 
water, liquid nitrogen and liquid helium at their respective boiling points, Tb = 373 
K, 77 K and 4.2 K, k being the Boltzmann’s constant = 1.38 x 10 -23 J/K. 

2.3 Show that 

(*) 6 L = ^(PiSqi) (*») and (tit) 6 T + 6 V = 0 

for conservative systems. 

2.4 Show that Appell’s equations of motion are valid for generalised coordinates, that is, 
they can take the form 

QS dzj 4 ^ , 

— = Qi = Fj-~ where 5 = 2^ m fc (x fc ) 2 is the energy of 

acceleration of the system. 

Consider a bead of mass m sliding under gravity on a uniform smooth circular wire of 
mass M radius r 0 , which rolls on a horizontal plane keeping its plane always vertical. 
Deduce Appell’s equations of motion for the systems and confirm that they are none 
other than the Euler-Lagrange equations of motion. 


Copyrighted material 



Lagrangian Formulation 93 


2.5 Consider in detail the transformation x = rcosO and y = rsinB. F x and 
F y are the Cartesian components of the external force F. Derive the components of 
the generalised forces Q r and Qg in terms of the generalised coordinates r and 0, 
and compare with the force components F r and Fg y the latter being the radial and 
transverse components of F. Show that F r = Q r} but Fg ^ Qg. Explain why this 
is so. Using the spherical polar coordinates, interpret the meanings of Qg and Q+. 

2.6 Find the generalised forces for the generalised coordinates describing the small am¬ 
plitude oscillation of a double pendulum. Use D’Alembert’s principle to find the 
equations of motion. 

2.7 Construct the Lagrangians for the following dynamical systems and find the first in¬ 
tegrals for all the cyclic coordinates: 

(i) A system of two particles having masses mi and m 2 are connected by an inexten- 
sible, massless string of length l passing through a small hole in a horizontal table. 

(ii) A block of mass M constrained to slide along a smooth horizontal bar and an¬ 
other mass m (< M) is connected to M by a massless, stretchless, flexible string of 
length l. The second mass can swing freely in any direction. 

(iii) A rigid and smooth circular wire of radius R is constrained to rotate in its plane 
(horizontal) about a fixed point on the wire with constant angular speed u>. Consider 
the motion of a bead of mass m sliding freely on the wire. 

(iv) A disc of radius R rolling on a perfectly rough horizontal plane and constrained 
to remain vertical. This is a nonholonomic case. Take it as a separate problem and 
solve it. 

(v) A charged particle moving under the Lorentz force law given f = eE + ev x B, 
where E = - V# - dA/dt and B = V x A are the electric field and magnetic 
induction respectively, A(r, t) and <p(r y t) being the vector and scalar electromagnetic 
potentials . 

2.8 A flexible tape of length L and thickness k is tightly wound and is then allowed 
to unwind as it rolls down on an incline that makes an angle a with the horizontal. 
Form the Lagrangian, determine the energy integral of the system. Solve the equation 
of motion to find the time to completely unwind the tape. 

2.9 A pendulum bob of radius r is rolling on a circular track of radius R ( > r). 
Construct the Lagrangian, derive the equation of motion and compare its period of 
oscillation (of small amplitude only) with that of a simple pendulum of string length 
R - r. 

2.10 Assume the Lagrangian for a relativistic pendulum motion to be L(x,t>) = m t> c 2 {( 1- 
v 2 /c 2 ) -1 / 2 - 1} - kx 7 / 2, k being the spring constant. How is its period modified 
for a semi-relativistic speed v ~ 0.1c? 

2.11 We know that the energy like Jacobi integral q^dL/dqn) - L exists whenever 
t does not explicitly occur in L. Show that the following rheonomic system is non- 
conservative although it still has a Jacobi integral. A bead of unit mass is moving 
under gravity on a smooth rigid circular wire of radius r„, the wire being driven with 


Copyrighted materi; 



94 Classical Mechanics 


a steady angular speed a; about a vertical diameter. Show that in this case the Jacobi 
integral is not equal to T + V. Prove that only for conservative systems does the 
Jacobi integral equal T + V. 

2.12 Sometimes a rheonomic system can be conservative. Consider the following case. 
Two particles are connected by a rigid massless rod of length l which rotates in a 
horizontal plane with a constant angular velocity w. Knife-edge supports at the two 
particles prevent either particle from having a velocity component along the rod, but 
the particles can slide without friction in a direction perpendicular to the rod. Find 
the equations of motion. Solve for x and y, the coordinates of the centre of mass, 
and the constraint force as functions of time, if the centre of mass is initially at the 
origin and has a velocity v„ in the positive y- direction. Show that the system is 
conservative, even though it is rheonomic. Find the Jacobi integral. 


2.13 Find the equation of motion corresponding to the Lagrangian 


L(x,x) = e ** ** + 2x J e 0,3 daj 

Find the energy integral for the system. Construct another Lagrangian which can 
give rise to the same equation of motion. 


2.14 A rough and heavy horizontal disc rotates with a constant angular velocity u; about 
a stationary vertical axis passing through its centre. A spherical ball of mass m and 
radius R is let loose on the rotating disc and the ball starts rolling on the disc without 
slipping. Find the trajectory of the ball with respect to the outside fixed frame of 
reference. 


2.15 A child’s swing of variable string length {(<), the length being manipulated by pulling 
or releasing it through the hinge point, is oscillating in a vertical plane. Using the 
Lagrangian method, make a study of this non-conservative system. Under what cir¬ 
cumstances may the energy of the system accumulate with time? 

2.16 Show that the total number of independent integrals of motion (/) for a closed system 
embedded in an n-dimensional Euclidean space is given I = (n 2 + 3n + 2)/2. 
How many of these are additive in nature? Now for 3-D Euclidean space, n = 3 
gives I = 10. Even if we consider the 3 + 1 -dimensional space time continuum 
from the point of view of special theory of relativity, justify that I should still be 10 
only. How many more integrals of motion are needed for a closed system of 3 particles 
interacting with one another gravitationally or for that of a freely moving rigid body? 

2.17 Show that the following are the integrals of motion: 

(i) j4 = v x L - Kr/r for the motion of particle in the potential V = - K/r , L 
being the angular momentum = r x j>, and K = const. 

(ii) L-B + e\r x B | 2 /2 for a charged particle moving in a uniform field of magnetic 
induction B. 

(iii) L- k + eM/yJr 2 - (r • i) ? for the motion of a charged particle in the field of 


Copyrighted material 



Lagrangian Formulation 95 


a magnetic dipole having a constant moment ii = Mk, which can be produced by 
a vector potential A = M(k x r)/r 3 

(iv)F-(v x L) — K(F’t)/r + (F x r) a /2 for a particle moving in a combined 
fields due to a uniform force field F and a Newtonian field, given by the potential 
V(r) = - K/r - F-r 

2.18 How do the energy and momenta change under 

(i) the Galilean transformation between two inertial frames of reference which have a 
constant relative velocity V, that is, r = r 1 + Vi 

(ii) a rotating coordinate transformation given by 

x = *'cos- y'sinurf y = x'sin ut + y' cosu)t and z = z' 
where a; is a constant. 

(iii) translation with constant acceleration given by r = r' + u a t + gt*/ 2, * 0 

and g being constant vectors. Could all such changes be accommodated by some 
suitable Lagrangian gauge terms dF/dtl 


Copyrighted materi 



3 

Rotating Frames of 
Reference 


3.0 INTRODUCTION 

We have seen earlier that the equations of motion due to Newton, D’Alembert, or Lagrange 
are valid only if the forces or the Lagrangian is formulated in some inertial frame of reference. 
However, many systems in nature are found to be naturally rotating or accelerating and 
it may be more convenient to use the coordinates that directly refer to such noninertial 
frames. For example, the earth is rotating about an axis passing through its geographical 
north pole. So any reference frame that is firmly attached to earth is a rotating (noninertial) 
frame. Therefore, it is necessary to develop methods for writing down the Lagrangian or 
the Newtonian equations of motion using the coordinates of a rotating frame. Taking the 
rotating frames as an example, we have in this chapter discussed in detail the consequences of 
changing over from an inertial to a noninertial frame. The transformation laws for rotating 
frames were given by a French engineer Gustave Gaspard de Coriolis (1792 - 1843) in his 
book entitled TraiU de la mechanique des corps solides published in 1831. It was Coriolis 
who changed Leibniz’s definition of vis viva , mv 2 to \mv 2 , which is today’s kinetic energy. 


3.1 INERTIAL FORCES IN THE ROTATING FRAME 

Let there be a fixed frame S with t, j , It, as the fixed unit vectors forming a rectangular 
Cartesian triad and the frame S' with its triad (•',/, k'), originally coincident with S be 
rotating with velocity u ( = no;) about their common origin O ( see Fig. 3.1). Given any 
vector G, how do we see it and its time derivative in the two frames? Physically, the G 
vector is the same in both the frames so that 

«L-d = ®L (*•») 

But the components of G in S and S' are different because the unit vectors in both the 
frames are pointing differently. Hence, 

G l fixed = Gli + G = G 'j + G *f + G '^' = °Lt 


Copyrighted 



Rotating Frames of Reference 97 



Fig. 3.1 Cartesian axes of an inertial (S) and a rotating (S') frames of 
reference, S' rotating about their common origin O with an in¬ 
stantaneous angular velocity of rotation <•» with respect to S 


The time derivatives of flexed and G tot must also be equal, because they are also one and 
the same physical vector. Thus, 




One usually denotes 


by 



fixed 


because this is the rate of variation of G as measured by an observer in the fixed frame. 
Since the unit vectors », j, k, of the fixed frame S do not change with time, we can write 



dG\ dGi 
~dT * + ~df 


+ 


dt 


which must be equal to (d[G r ot) /dt ) given by, 


dGT 

Jfi 


= Si 0 -' 

= + G'J + Git') 

_ iGS i dG±:, dG',:, , <fi> dj‘ dk' 

- dt * + + ~dT k + + Gi it + a 


Copyrighted material 



98 Classical Mechanics 


One denotes the first three terms by [dO/dt] Tltl because this is precisely the time derivative 
of G as measured by an observer in the rotating frame S'. We are defining all these concepts 
explicitly for reasons of clarity. 


Now using Eq. (2.67) for any arbitrary vector A, we have 

Taking the limit as At -» 0, 


AA . AO t 

St = n Si xA 


dA 

dt 


— =u)nxA = uxA 


Applying this to •' J\k' we get 
di' 


aw :> dj am 

— = «X. — = « X i 


ik' 


(3.2) 


Using these in the expression for ([dG/dt] flxcd ) we get 

[tL = [SL + " * (°' J ' + G 'j + Git ') 


dG 

dG 

. dt . 

. dt J, 


Thus we have an operator identity 


d 

d 

.^Jaxcd 

dt \, 


(3.3) 


u x (3.4) 

where the meanings of the brackets and suffixes are as stated above. This result is valid for 
any arbitrary vector G and any arbitrary u passing through the common origin. 

Putting G = r, the position vector of a particle, we get 

KL = 13„. + “ x r 

or 

Waxed = Mrot + " x r 
Denoting (*] flxe d = »o and [«] rot = v we write 

v a = « +o> x r (3.5) 


Copyrighted material 



Rotating Frames of Reference 99 


Again, putting G = v 0 in Eq. (3.3), 




+ u> x v 0 


= ?j-(v + u x r) + u> x (v + u x r) 
at 

dv , . 

= — + u;xr + 2u;x» + a;x(u>xr) 


where dv/dt is the acceleration as measured in the S' frame. If we are to consider the motion 
of a particle of mass m, its equations of motion from the point of view of an observer who 
is at rest with respect to S' can be expressed as 

dv dv 0 . . . 

m— = m—— + mr x u> + 2 mv x u + m(w x r) x lj (3.6) 


The first term on RHS of Eq. (3.6) corresponds to the product of mass and the inertial 
acceleration in the fixed frame, that is, the actual forces applied on the system, or the true 
external forces. Other terms correspond to the psuedoforces or the fictitious or inertial 
forces that arise due to the fact that the rotating frame is noninertial. They are called, from 
left to right, Euler force, Coriolis force, (named after Gustov Coriolis who derived formulae 
for the rotating frame of reference in 1829 ) and centrifugal force respectively. These forces 
appear to exist only in a rotating frame of reference. 

Equation (3.6) can also be obtained in the Lagrangian formulation. Consider the La- 
grangian of the system, which has to be evaluated in the inertial frame only, and then 
should be expressed in terms of quantities defined in the rotating frame. Thus one can write 


L = ^m|v 0 | 2 - V = ^m|v + u> x r| 2 - V 

— |m|v| 2 + mw (w x r) + ^m(o> x r) ■ (u> x r) - V 


(3.7) 


To construct various terms occurring in the Euler- Lagrange equations of motion, consider 
dL 

— = mv + m(u x r) = m(v + u> x r) = p 
ov 

where p is the generalised momentum corresponding to r. So the true momentum of the 
particle is not just mv but mv 0 which is the momentum observed in a fixed frame, say p 0 . 
Next consider, 

— = m(v xu?) — mu x (u? x r) - VV 
or 

Substituting in the Euler— Lagrange equations of motion 


dL 1 dL 


d 



100 Classical Mechanics 


which is the same as Eq. (3.6), provided we identify — VF with the externally applied 
forces, which must be equal to m(dv 0 /dt). 

The total energy of the particle in the rotating frame can be calculated as follows: 

E = p v — L = mv 0 ■ (v 0 — u x r) — \ mv l + V 

= \ mv l + V - tnv 0 ■ (o> x r) = E 0 - p (u x r) ( 3 - 8 ) 

= E 0 — u ■ L 

where E 0 is the total energy measured in the fixed frame and L is the angular momentum 
of the system measured in the rotating frame. (Note however that L = L 0 ). Both E and 
E 0 are the constants of motion in S' and S respectively, as the Langrangian does not have 
any explicit dependence on time, but E can be greater than or less than E 0 depending on 
the relative orientations of L and u. 


3.2 ELECTROMAGNETIC ANALOGY OF THE INERTIAL FORCES 

We note in Eq. (3.7) that the Coriolis force f cr = 2m(v x w) is a velocity dependent 
force and it does not do any work, as the instantaneous velocity v is always perpendicular 
to the force / cr . This is a gyroscopic force and therefore, can be included in a generalised 
potential energy U defined in Eqs (2.47) and (2.48) which satisfies the Euler- Lagrange 
equation of motion (2.26). One can, in fact, rewrite Eq. (3.7) as L — T — U with 

T = |mt / 2 and U = V - ^m\u> x r | 2 - mv(w x r) (3.9) 

The first two terms in U are velocity independent potential energies and can be put together 

in the form of an effective ordinary' potential energy F e ff — V — |m|a/ x r | 2 The 
new term — 5171 ( 0 ; x r | 2 is called the centrifugal potential energy (say V c f) because the 
centrifugal force is simply equal to — VF c f . Now, from the definition of energy E in the 
rotating frame as given in Eq. ( 3.8 ) it can easily be shown that 

E = T + Veff = \mv 2 - ^m|u> x r | 2 + V (3.10) 

The velocity dependent part of U does not enter the expression for energy. On comparison 
with the generalised electromagnetic potential energy given in Eq. (2.52) one can verify the 
following correspondences: (find out what would electric field correspond to) 

( 1 ) The scalar potential energy e<t> <—» F c ff 

(2) The velocity-dependent potential energy e(v • A) *—♦ mv • (u> x r), implying further, 

(3) The vector potential (momentum) eA *—* m(u x r), 

(4) The magnetic induction B = (V x A) <—► (m/e)V x (a; x r) = (2m/e)u, if u 
does not vary from point to point, 

(5) Magnetic force e(t> x B) *—► 2m(v x w) = Coriolis force, 

( 6 ) Canonical momentum mv + cA <—> m(v + ut x r) and 


Copyrighted material 



Rotating Frames of Reference 101 


(7) Energy \mv 2 + e<f> <—♦ |mti 2 + 

The analogy seems to be quite appropriate; particularly striking is the analogy between 
the magnetic field B and the angular velocity vector u>. The magnetic field results, in fact, 
from the vortices of the charge motions; and surprisingly enough, the cyclotron frequency 
of revolution of a charged particle in a uniform magnetic induction B is given by u> c = 
eB/m ( = 2|<i>|, from the above analogy)! 


3.3 EFFECTS OF CORIOLIS FORCE 

First we make a few general comments. The earth rotates from west towards east, so that 
at any place on earth the angular velocity vector u is directed towards the north and is 
parallel to the axis of rotation of the earth, that is, the polar axis (see Fig. 3.2). In any 
frame attached to the earth, an object moving with velocity v will experience a Coriolis 
force 2 mv x u. The direction of this force is perpendicular to both t; and u and the 
magnitude of the Coriolis acceleration can never exceed 

2u>v = 1.46 x 10 -4 v (3.11) 



Fig. 3.2 The direction of u at any arbitrary point on the surface of the 
earth 

where we have used the value of u; for earth (one complete rotation takes place in 23 h 56 ,,t 
4" .01) given by 

" = [ 23 . 934 ^ 3600 ] “ 7 292 X 10 ' 5 rad/ “- < 312 > 

For an object moving even with a speed of lkm/sec, this upper limit is only 0.15m/s 2 ~ 


Copyrighted 



102 Classical Mechanics 


0.015$. Thus the magnitude of the effect of Coriolis force is extremely small compared 
to $, the acceleration due to earth’s gravity. However, in many natural circumstances, the 
period of time over which this tiny acceleration acts, can be quite long leading to substantial 
deflections. We now describe a few such instances. 

3.3.1 When A River Flows on the Surface of the Earth 

Rivers flow approximately in a horizontal plane. However, a nominal downward slope in the 
direction of their flow is important for maintaining the speed of the flow. So the gravitational 
force acting primarily downward has also a small component in the forward direction of flow 
depending on the magnitude of the slope of the downstream. But there is absolutely no 
component of g rfT (see below) acting along the breadth of a river. Hence the component of 
the Coriolis force, however small, can act freely on the moving water across the direction of 
the stream. 

In order to find out this transverse horizontal component of the Coriolis acceleration, let 
us choose the direction of the flow to be the x-axis, the transverse horizontal axis to the 
left of the flow direction as the y-axis and the local vertical (up) as the z-axis. Let the 
direction of the flow at a place having geographical latitude A , make an angle (f> (in the 
anticlockwise sense ) with respect to the geographical north direction. Therefore the earth’s 
angular velocity vector <i>, with respect to the above reference frame can be expressed 

u> = u>(sinA4 + cos A cos (pi - cos A sin <pj) 

The velocity of the flow is 

v = vi 

giving the Coriolis acceleration 

a c = 2v x u = -2vu/[sinAj + cosAsin^Jbj (3.13) 

The small vertical component of a r is lost in comparison with g rK as g rK = - 
but along the ^-axis the only acceleration is — 2vu/ sin A. This quantity is independent of <j) 
and hence does not depend on whether the flow is towards north or south or east or west. 
But it depends on A. It is negative ( that is, to the right of the flow ) for A > 0 (that is, 
in the northern hemisphere), positive (that is, to the left of the flow) for A < 0 (that is, 
in the southern hemisphere ) and vanishes for A = 0. Thus the effect is absent for river 
flowing past or along the geographical equator. 

Thus, a Coriolis force will be experienced by the water in the rivers flowing in any direc¬ 
tion, causing a deviation towards the right of the flow direction in the northern hemisphere 
and to the left of the flow direction in the southern hemisphere. As a result, the correspond¬ 
ing banks of the river will be denuded more, which is actually observed. The other effect 
of this jth component of the Coriolis force is to raise the right banks of the rivers in the 
northern hemisphere to a slightly higher level than the left banks. The opposite is true for 
the rivers flowing in the southern hemisphere. 


Copyrighted material 



Rotating Frames of Reference 103 


3.3.2 Air Flow on the Surface of the Earth 
(a) Cyclones 

When a low pressure zone is created at any place on earth, pressure gradients are set 
up and air flows towards the low pressure zone to equalise the pressure difference. In 
the absence of Coriolis force the direction of wind velocity would be perpendicular to the 
isobars (the equipressure lines) as shown in Fig. 3.3. However, in the northern hemisphere, 
Coriolis force acts on the wind to deviate its direction towards its right and the wind now 
flows spirally towards the centre of low pressure in a counter-clockwise direction. Since the 
Coriolis force acts perpendicular to the trajectory, it will provide the centripetal force, that 
is, v 7 /R ~ 2u iv, where R is the radius of curvature of the wind trajectory. This gives 
R ~ v/2u. For wind speed t; = 30 m/sec (that is, 108 km/hr), R ~ 210 km. The radius 
of the cyclonic activity is directly proportional to the speed of the wind. Of course, the eye 
of the cyclone also moves along with the centre of low pressure. In the southern hemisphere 
the cyclonic direction about the eye of the cyclone is clockwise and the magnitudes of the 
cyclonic activity are all similar. 


isobar 



Fig. 3.3 Coriolis deflection of the direction of wind motion be¬ 
tween two atmospheric isobars 


(b) Trade Winds and Tropical Winds 

These occur, again, due to moderate (as compared to cyclonic case) pressure gradients in the 
atmosphere set up over large distances. Again, the winds would tend to flow perpendicular 
to isobars but the Coriolis force makes the flow direction deviate towards the right or 
left depending upon whether the phenomenon is occurring in the northern or southern 
hemisphere . Since in this case Coriolis force is comparable in magnitude to that due to the 
pressure gradients, winds continue to deviate until their flow is parallel to isobars and the 
resulting Coriolis force just balances the pressure gradient force. The wind then continue 
to flow parallel to the isobars circulating in the northern hemisphere counter clockwise 


Copyrighted material 



104 Classical Mechanics 


around the centre of low pressure. The corresponding winds flow clockwise in the southern 
hemisphere. The same type of analysis goes for the oceanographic water currents. 

It must be emphasised that this is a simplified picture of the real phenomenon where, 
for example, we have not bothered about the way the pressure gradients are set up which 
involves complicated hydrodynamic equations. We have also neglected the effect of viscosity. 

3.3.3 Projectile Motion 

We now analyse the effect of the Coriolis force on a projectile. 

The acceleration of a projectile, with respect to the earth measured at any place on the 
earth is given by Eq. (3.6), where u> is the constant angular velocity of earth’s rotation 
about its polar axis. In general, we have, 

ii = 2v x u + (w x r) x w -)- j (3.14) 

where g is the acceleration purely due to gravity of the earth (mg — — VV, V is the 
actual gravitational potential energy of the projectile due to the earth at the point under 
consideration). We define 

flcff = (<*> * r) x u + g (3.15) 

to be the effective local acceleration (due to combined gravity and centrifugal force of the 
earth’s rotation), which can be derived from the effective potential 

V.n = V - j|u. x r| 2 

in the form of mg eff = — Vl4ff. 

Before cooling down to the present form, the earth was once in fluid state rotating about 
its axis. The free surface of any fluid in stationary state assumes an equipotential surface 
corresponding to the superposition of all the potentials due to, say, the gravitational force 
and the centrifugal force due to rotation, if any. If it were not an equipotential surface, 
the forces tangent to the surface would have caused the actual transport of the fluid mass 
in order to nullify them and generate a mechanical equilibrium. On such a surface, called 
the geoid , g efT defined through Eq. (3.15) must be normal at every point. In fact, the 
present surface of the earth is, to a very good approximation, a geoid. Therefore at every 
point g efT is normal to the earth’s surface defined by the mean sea level, which is, by 
property of water am equipotential surface. The contribution due to centrifugal acceleration 
is maximum on the equator of the earth (magnitude = 0.03392 m/sec 2 ) and acts directly 
against the local gravity, giving g e n = 9.7803 m/sec 2 on the equator. At the poles, the 
centrifugal acceleration vanishes, but </ e fr at the poles increases by 0.05173 m/sec 2 , the 
excess over the centrifugal correction being a contribution coming from the oblateness of 
the geoid, which amounts to 0.0178 m/sec 2 . 

Equation (3.14) now reads as 

t> = 2v x a > + g cn (3.16) 

To solve this equation we note that the magnitude of Coriolis acceleration is much smaller 
than g ef f. Therefore we can adopt the method of successive approximation (iterative process) 


Copyrighted material 




Rotating Frames of Reference 105 


as our strategy to solve Eq. (3.16). 

Suppose w(t) satisfies Eq. (3.16). Then we write, 

*(0 = *i(<) + v 2 (t) 

such that 

|*i(0l > MOI «i = S e n and «2 = 2th x u 

Assuming $ pff = constant over the trajectory of the projectile, the first of these equations 
gives, on integration, 

*1 = Sent + Vo 

with v„ as the constant of integration. And then , on substitution of this *i, the second 
equation gives, on further integration, 

*2 = {9efi x <*>)/ 2 + 2(t>o x u>)t 


Therefore, 

v(t) = *! + t> 2 = v 0 + ($ eff + 2t t 0 x u)t + {g eff x u)t 2 (3.17) 


Since v = dr/dt, we integrate this equation with respect to time to get an approximate 
solution correct up to the order gut 2 , 

r(<) = r„ + v 0 t + i(* cfr + 2v a x u)t 2 + i($ cff x w )< 3 (3.18) 

where r„ and v a are the initial position and velocity of the projectile. 

For constant g cfr , an exact solution to Eq. (3.16) has, however been provided by F. R. 
Gantmakher (1960) called Gantmakher’s formula for deflection due to Coriolis force and is 
given by 


r(i) = r a + v„t + l -g KK t 2 + --}(« x * 0 ) 


Let us now consider the projectile motion over a given geographical latitude A (see Fig. 
3.4). There can be various cases: 


(a) The projectile dropped from a height h with initial velocity zero: 

Thus v 0 = 0, r a = hit, g cf{ = - g^k, and u = u;(cos A j + sin At), where t is 
towards east, j towards north and k vertically upwards over the place under consideration. 
Substituting these in the approximate solution (3.18), one obtains 

r(<) = hk - + ^u cos Xg ciT t a i 


Copyrighted material 



106 Classical Mechanics 



Fig. 3.4 Natural set of rectangular Cartesian coordinates at any 
point on the surface of the rotating earth, representing 
the relevant rotating frame of reference 


Now the projectile will hit the earth as soon as the condition h = g^l 7 /2 is satisfied and 
at that moment 


cos A | 


9cft) 


It does not reach to the origin, but is shifted from the origin by the above amount to the 
east irrespective of the sign of A. The Coriolis deflection is therefore towards east at all 
latitudes in both the hemispheres (as cos A > 0 for - tt/2 < A < tt/2) and the amount 
of deflection from the local vertical is given by 


d = 


1 

-ojg,, K cos 


A 



(3.20) 


For h = 300 m and A = 40", d ~ 9 cm but for h = 30 in, d is smaller by a factor 
10\/l0 giving only about 3 mm, which is very difficult to detect. 

Newton was the first to propose an experiment of this kind to detect the rotation of the 
earth. After about 100 years in 1791, Gugliemini of Bologna had made an attempt to carry 
out the experiment from a 300 ft high tower but the results were disappointing due to sway 
in wind. Meanwhile Gauss and Coriolis did extensive calculations on the Coriolis deflection. 
Reich’s experiments in 1831 - 33 in the mines of Frieberg, in fact, resulted in d = 2.58 cm 
for h = 158 in. Later on in 1903, E. H. Hall made 948 trials at the 23 meter high tower 


Copyrighted 



Rotating Frames of Reference 107 


at the Harvard University and got the expected result d ~ 1.50 mm. But he also got a 
southerly deflection of about 0.045 mm. In 1912, J. G. Hagen used Atwood’s machine to 
artificially reduce the by a factor of about 10 and proved the Coriolis effect beyond 
doubt and also observed no appreciable southerly deflections. 

One, in fact, arrives at very small southerly deflection of order ^g e ffu; 2 t 4 sin A cos A from 
the formula in Eq. (3.19) together with the usual easterly deflection given by d in Eq. 
(3.20). But this is too small to be observed and also the effect of non-uniformity in g e ff 
becomes important at that level. 

(b) A projectile is sent vertically up with velocity v 0 to reach a height h above the 
ground and it returns to the ground: 

In this case, we have 

v 0 = v 0 k = y/2g e nh k r a = 0 g eK = — g e ak and w = w(cos A j + sin Afc) 
Then Eq. (3.18) gives us 

r(<) = (t> 0 * - \g c frt 2 )k - \/2{kffh t 2 u cos A* + ^wcos \g ef ft 3 i (3.21) 

The coefficient of k in Eq. (3.21) must vanish as the projectile returns to the ground. 
Therefore, 

r(f) = - ^-y-wcosAt (3.22) 

3 9 e ff 

Thus the deflection is to the west in both the hemispheres and vanishes at the poles. 

When compared with Eq. (3.20) the deflection given by Eq. (3.22) is exactly 4 d and its 
sign is just the opposite. Qualitatively this can be understood in the following way. 

The earth is rotating from west to east with an equatorial speed of rotation of about 465 
m/sec. This speed reduces by a factor of cos A as we go to any latitude A, nevertheless the 
sense of rotation remains the same everywhere. Now as we climb a tall tower, the speed of 
rotation of the tip of the tower is slightly higher than that of its base. So when we drop a 
stone from the top of a tower, its initial easterly speed is higher than that of anything at 
the base. With this slightly higher speed any particle dropped from a height reaches the 
ground with a net easterly deflection. This is true for all latitudes. Now, when we project 
a stone up from the ground its initial horizontal speed due to earth’s rotation is lower than 
that for any point above the ground. Hence the projectile gradually lags behind the earth’s 
rotation, and'retums to the ground with a net westerly deflection, and it is larger because 
it has spent longer time in flight and has lagged behind the earth’s rotation all the time. 

3.3.4 Coriolis Effect in Atomic Nuclei 

We have seen that in a rotating frame the energy of any particle E is less than its inertial 
value E 0 by an amount u> ■ L (see Eq. (3.8)), where u> is the angular velocity of rotation 
of the rotating frame and L is the angular momentum of the particle as defined in the 
rotating frame. So a particle in the rotating frame has the lowest energy when the angular 
momentum vector L aligns with the angular velocity vector u) of the rotating frame. 
Usually in nuclei having an odd number of nucleons, the odd nucleon is found to rotate 


I 


Copyrighted material 



108 Classical Mechanics 


about the even numbered nucleon with a liigh orbital angular momentum . In such cases 
the u • L term becomes quite important in their Hamiltonian and the lowest energy states 
are found to be associated with the maximally aligned angular momenta to the rotationally 
aligned bands of nuclear states. This has been observationally verified. 

3.3.5 Coriolis Phenomenon in the Planetary Atmospheres 

For a fast rotating planet, the Coriolis effect can be quite prominent and may give rise to 
gross atmospheric structures on the planet. Jupiter and Saturn are more than or about 10 
times bigger in size than the earth, and rotate with an angular speed of about two and a half 
times that of the earth and show a number of bands parallel to their respective equators. 
The vertical currents of air due to convection are subjected to a large Coriolis force which 
acts in the horizontal direction, and they finally start encircling the planet in the rotationally 
aligned orbits. 


3.4 FOUCAULT’S PENDULUM 


A French physicist L£on Foucault, noticed that the small effect of the Coriolis force could 
be greatly amplified by using a pendulum, an idea which had escaped the notice of Gauss, 
Laplace, D’Alembert, Poisson and others. He noticed that the rightward Coriolis deflection 
on one swing of the pendulum could not be undone in the return swing: the effect would 
accumulate! Thus the effect of Coriolis force of terrestrial origin moved from the domain 
of theory and outdoor observations to that of observation in a laboratory experiment. A 
simple set up is shown in Fig. 3.5. 

The equation of motion for the pendulum including the Coriolis term is 

f + k 2 r = 2(v x u) (3.23) 

where k 2 = g,.tr/l, l being the effective length of swing. Written in terms of Cartesian 
components, these are 

x 4- k 2 x = 2(yu z - ZLJ V ) c- 2yu z 

(3-24) 

and y + k 2 y = 2 (zu z - xu>.) ~ - 2 xv z 

The last two approximations are justified because i and z are negligible compared to x, y, 
and x, y. Equations (3.24) are coupled equations of motion in x and y. To solve, choose 
complex variable u = x + iy, t = \J- 1. Then multiplying the second of Eqs (3.24) by 
i and adding to the first, we get 

u -f k 2 u = - 2iv z u 


or 


ti + 2 iu z u + k 2 u = 0 


Copyrighted material 



Rotating Framet of Reference 109 


il 111 11/ 



Pig. 3.5 Foucault’s pendulum: its plane of oscillation, while 
passing always through the origin, rotates slowly 
in the clockwise sense 


The general solution of the above equation is 

u — exp(- icj M t)[Ai exp(ik't) + A-i exp(- tfc'f)] 
where k' 2 = A: 2 + w 2 . Thus we have 

* + iy = (* 0 + ty„)exp(-(3.25) 

where z a and y„ are solutions when Coriolis force is absent, k' ~ k is assumed as 
k » u>,. 

Equation (3.25) tells us that the plane of oscillation of the pendulum rotates with an 
angular velocity - u x k, that is, opposite to the sense of rotation of the earth. The period 
for a complete rotation of the plane of oscillation is 



u>. 


2tt 

u> sin A 


(3.26) 


where A is the geographical latitude of the place. At the poles, T = 24 hrs, while at the 
equator T = oo so that no rotation of the plane of oscillation is observed. Foucault had 
demonstrated the truth of Eqs (3.25) and (3.26) through his historic pendulum experiment 
in 1851, that goes by his name. 

Note that by measuring T Foucault measured the period of rotation of the earth. Thus we 
get the terrestrial demonstration of the earth’s axial rotation. This was the first experimental 
proof that the earth is in fact rotating with respect to the inertial frame in which Newton’s 
laws are to be valid, with an angular velocity which is precisely the same as that inferred 
from the apparent diurnal rotation of the sun, moon, and the star sphere. 


Copyrighted 



110 Classical Mechanics 


It is straightforward to understand the mathematical logic behind the appearance of u) x 
rather than u in Gqs (3.25) and (3.26), but it is not so easy to intuitively comprehend the 
same. If Foucault’s experiment be performed at the poles, = u>, and we can follow 
why the plane of oscillation of the pendulum does not change in the inertial frame — it is 
because the earth rotates beneath the support of the pendulum exactly once a day. But 
what is difficult to understand is that while every other place on earth is also rotating back 
to the same point at an interval of one day, Foucault’s pendulum hanging over the place 
is found to return to its original plane of oscillation much later, for example, over a place 
with geographical latitude, say A = 30", T = 48 hours. Why are these two events not 
synchronised at all points on earth? 

Apparently, the reason is that the pendulum does not understand the curvature of the 
earth. It thinks as if its bob is always lying on a plane surface which can be constructed by 
making a huge cone, with its base touching the small circle for a given A and having the 
apex of the cone at some point above the nearer geographic pole. Only with a radial cut, 
such a cone can be flattened on to a plane to form a disc of radius I? cot A (R being the 
radius of the earth), with a missing segment of the disc. (The outer perimeter of this cone- 
turned-into-a-flat-disc is not 27r/?cot A but 2^/2 cos A). In the time the earth completes one 
diurnal rotation, Foucault’s pendulum at latitude A completes a precession up to an angle 
covered by the arc perimeter of the flattened disc, which is smaller than 2* by a factor of 
sinA, which is what appears in Gq. (3.26). 

Foucault’s precessing pendulum makes use of the horizontal component of the Coriolis 
force. The existence of the vertical component of the Coriolis force was first demonstrated 
by another ingeneous experiment devised by the Hungarian physicist Rolland Eotvos in 
1922. He took a chemical balance, removed its pans and allowed the rotation of the beam 
in the horizontal plane. The Coriolis force on the two arms of the beam acts up and down 
producing a horizontal torque on the beam which brings the balance into forced vibration. 
Although this effect was very small, Eotvos was also able to demonstrate the rotation of the 
earth. 


3.5 VELOCITY AND ACCELERATION OF A PARTICLE WITH RESPECT 
TO A SYSTEM HAVING TWO INDEPENDENT ROTATIONS ABOUT 
A COMMON POINT 

This situation is realised in a precessing flywheel ( see Fig. 3.6). Let S„ be the inertial 
frame with ( x„,y 0 ,z 0 ) axes fixed in space and origin at the centre of the wheel. z„ is the 
vertical axis about which the precession of the rotating wheel as a whole takes place. 

Let Si with axes (x\,y\,z\) be the intermediate rotating frame rotating with angular 
velocity about the z„ axis so that z„ and Z\ axes are common but X\ and y\ axes are 
rotating in the plane of x„ and y„ axes with the angular speed The X\ axis is chosen 
to be along the normal to the plane of the wheel. 

Let S 2 having axes (x 2 ,ya,^ 2 ) be the body frame fixed to the body of the wheel. x\ 
and x 2 axes are common. The wheel rotates about this common x\- x 2 axis with angular 


Copyrighted material 



Rotating Frames of Reference 111 



velocity u >\. Thus the axes y 2 and 22 rotate in the plane of the axes 3/1 and z\ with the 
same angular speed u)\. The origins of all the frames, S ol Si and S 2 coincide. 

In order to deal with the most general case of rigid body rotations about a fixed point we 
need to add one more rotation and that is about the 22 axis, passing to a new frame say 
S 3 , which will then be the body frame, and Si and S 2 will play the role of intermediate 
frames. The resulting three rotations are called the Eulerian rotations. 

However, for the present, the problem we wish to deal with is to calculate the velocity 
and acceleration of any point fixed in the rotating body of the wheel with reference to the 
inertial reference frame S„. There are essentially two methods of doing this. 

Method A 

Calculate the velocity and acceleration in the frame Si and then transform to the velocity 
and acceleration in the frame S 0 . 

Method B 

Calculate the composite angular velocity and angular acceleration of the entire system and 
then transform from S 2 to S 0 directly. 

The relevant equations for the consideration of the above methods are Eqs (3.4), (3.5) 
and (3.6). < 

Method A 

The velocity and acceleration of any fixed point in the frame S 2 measured in the Si frame 


Copyrighted material 


112 Classical Mechanics 


(that is, in terms of the unit vectors of the Si frame) is 
®i = «i x r | j 

and 

fli = wi x r|j + u„ x (tu x r)|j 

where |j means that the quantities are expressed with respect to the unit vectors (triad) of 
Si. Note that r is the same in all the three frames. 

Now the velocity and acceleration of the same point with respect to the fixed S 0 frame 
in terms of the unit vectors of the frame Si are 

v a = t>i + u 0 x r|j (3.27) 

and 

a„ = ai + u > 0 * r|j + 2u > 0 x + u 0 x (u 0 x r)|j (3.28) 

Method B 

The composite angular velocity of the entire system expressed in terms of the unit vectors 
of S] is 

ft = wol, + wil, (3.29) 

The angular acceleration of the composite system with respect to S 0 as expressed in terms 
of the unit vectors Si is 

o = n|j + w, x ft|j (3.30) 

Then the velocity and acceleration of any point fixed with respect to S 2 and measured 
directly in S 0 but expressed in unit vectors of Si are 

v 0 = ft x r|, (3.31) 

and 

a 0 = a x r|j + ft x (ft x r)^ (3.32) 

It is now straightforward to check that both the methods give identical results. 


3.6 MORE GENERAL CASE OF TWO ROTATIONS SEPARATED BY ONE 
TRANSLATION 

As shown in Fig. 3.7, let the frame Si rotate about the common origin 0 of Si and the 
fixed frame S 0 with angular velocity u 0 . Let S'i and Si be the frames having directions of 
their axes identical but their origins 0 and O' separated by a translation R. Let the frame 
S 2 be rotating with respect to S'j about their common origin at O' with angular velocity 
Wi. The problem is to relate the velocity and acceleration of any particle with respect to 
S 2 frame to those with respect to S 0 frame in unit vectors of Si or S'j, for example. 

There exist, again, two methods of solution as described in the previous article. 


Copyrighted material 



Rotating Frames of Reference 113 


u»i Zj 



Fig. 3.7 Two rotating frames of reference connected by a translation between their origins 
O and O' 


Method A: 

Velocity and acceleration in Sj frame are (assuming t >2 ^ 0 in general) 

»'i = 1*2 1j + vi x r|j 

<*'i = |, + u> x r|j - 2v 2 x + W! x (wi x r)|j 

In the Sj frame these quantities are respectively 

t>! = R\ x + and ai = R\ 1 + a'i |j 

Finally these quantities with respect to the S 0 frame expressed in the unit vectors of Si 
will be 

v 0 = ®i|, + u; 0 x (r + -R)!, (3.33) 

and 

a 0 = ai|j + u 0 x (r + R )|, + 2 w 0 x t>i|, + x (w 0 x' (r + R))|, (3.34) 

Method B: 

The composite angular velocity of the whole system with respect to S Q is 

fl —■ w 0 |j + t*>i|j (3.35) 


Copyrighted 



114 Classical Mechanics 


The angular acceleration of composite system with respect to So is 


a = ft|j + u 0 x ft|j 


(3.36) 

Therefore, 

v 0 = R|j + t t>o x Rjj + r|j + R x 

Hi 

(3.37) 

Finally we can write 



a 0 = + u 0 x R\ x + 2u» 0 x R |j + ut 0 x 

+ r|, + a x r|j + 2ft x f|j + Q 

(». X *)|, 

X (ft X r)|j 

(3.38) 

where r = a 2 and f = v 2 . 




Following are two physical examples conforming to the situations described above. 

1. A turnable table fan rotates about a vertical shaft with the blades rotating about an 
origin which does not lie on this shaft. 

2. The earth is rotating about its own axis and at the same time revolving around the sun. 
Any point on the surface of the earth has a complicated motion with respect to the centre 
of the solar system. Generalising further, how does the motion of any point on the surface 
of a rotating planet, such as Jupiter, appear to a person sitting on earth? Such problems 
involve complicated rotations of more than one rotating frame of reference with their origins 
also separated. The methods described in the present section are adequate to tackle such 
complicated problems. One has to follow either method A or method B. 


3.7 SUMMARY 

In order to go from an inertial to a noninertial frame, the initial position vectors r,, velocities 
Vi in the Lagrangian scheme and iq, v, and a,, the acceleration in the Newtonian scheme 
are to be expressed in terms of the respective quantities in the noninertial frame. 

For frames rotating about a fixed point, the position vectors like any other vector in the 
two frames are identical, except that the components of the vectors along the respective 
coordinate axes (Cartesian) would be different. But if we consider any given vector , say G, 
and then take the time derivative of the respective components in the two frames rather than 
the time derivative of the vector as a whole, the answers would differ by a term ui x G 
as shown in Eq. (3.3). It is this extra term that finally brings about differences in the 
expressions for the acceleration in the two frames. 

The fictitious force terms that appear with reference to an observer in the rotating frame 
of reference are readily classified into centrifugal force, Coriolis force and Euler force — the 
first one being position dependent, the second one being velocity dependent and the third 
one being due to nonuniformity of rotation of the rotating frame, if any. 

The centrifugal force can readily be grouped into the class of gravity as it can be derived 
from an effective ordinary potential energy function. The Coriolis force is however found to 
be gyroscopic in nature, and therefore is not capable of doing work. It can be derived from 
a vector potential instead. One can draw a nice analogy between these inertial force terms 
and the components of the Lorentz force on electrically charged particles. The centrifugal 


Copyrighted 



Rotating Frames of Reference 115 


force resembles the electric force, and the Coriolis force is the exact replica of the magnetic 
force. In fact the link between the two is quite realistic in the sense that magnetic field itself 
originates in circulation of electric charge, and that the Larmor frequency is directly linked 
with our u. 

The effect of the Coriolis force due to rotation of the earth on projectiles, large scale air 
circulation on the surface of the earth, flow of rivers, etc. is studied in detail. The general 
rule is that in the northern hemisphere, the Coriolis deflection on a horizontally moving 
object takes place always to the right of the instantaneous direction of motion. Foucault 
had constructed a huge pendulum and demonstrated for the first time in 1851 that the 
earth is rotating with the same angular speed of rotation as was already inferred from the 
apparent diurnal motion of the stars around the earth. 

More general cases of rotations, such as two rotations about a common point, or two 
rotations separated by a time varying translation are considered, and general methods of 
handling such, and even more complicated, situations are outlined. 


PROBLEMS 


3.1 A smooth disc is rotating in a horizontal plane with uniform angular speed u> about 
a vertical axis passing through its centre. A particle is allowed to slide on the disc 
with negligible friction. Since the motion is in two dimensions and takes place in a 
rotating frame, analyse the motion from the point of view of the disc using a complex 
variable for denoting the position z = x 4 - iy of the particle on the disc. Show that 
the equation of motion in the rotating frame reduces to z + 2 *u»i — u; 2 z = 0 . 
Find the solution for the track if the initial position and velocities are supplied. 


3.2 Show that two infinitesimal rotations S$i (= Mini) and M 2 (= M 262 ) commute, 
but not the finite ones, say 0i and # 2 , where 0 = 6h means rotation by an angle 
6 about an axis implied by the direction of the unit vector n. Since the infinitesimal 
rotations commute, construct a finite rotation 0 by superposing a large number N 
of infinitesimal rotation A0 = 0/N. Following this procedure show that in the limit 
N —* 00 , a finite rotation of r„ by 0 leading to r such that 


sin 0 . 1 - cos 0 .. 

—{$ x r„) + --(# x (0 x r 0 )) 


or in matrix notation r = Sr„, where S is given by a matrix 


Sn = cos 66i 


sin 6 1 

+ -z-eijkOk + — 


COS0 


0i0j 


Show further that S is orthogonal, that is, S T S — S~ 1 S , and that SS~ 1 = SI is 
an antisymmetric matrix. Since all 3 x 3 antisymmetric matrices can be represented 
by an axial vector, in this case say by u, such that Cijk&j = 

Prove that r = fir 0 = u x r, where or is in general not the same as d0/dt. It is 


Copyrighted material 



116 Classical Mechanics 


given by 



3.3 Using the Gantmakher formula, derive the above relation between r and r„ connected 
by a finite rotation 0 = 0h, that is, the relation 

sin0,„ , 1 — cosd /y> .. 

r = r 0 + — (0 x r 0 ) + - - - (0 x (0 x r 0 )) 

Use Gantmakher formula to write the solution for the motion of a charged particle in 
a constant electric field E and a constant magnetic field B. Find the motion for the 
case with B = B t k, E = E z i + E y j, v 0 = v 0 k, r 0 = 0. 

3.4 Find the Lagrangian of a rigid symmetrical dumbbell (that is, two equal point masses 
connected a rigid and massless rod) rotating freely about its centre of mass and the 
centre of mass is moving in a circular track, not necessarily horizontal. 

3.5 Using the principle of conservation of angular momentum about the centre of the 
earth, show that the stone dropped from rest and from a height h above the ground 
will produce the same easterly deviation as would be given by a consideration of the 
Coriolis force. 


3.6 The Coriolis deflection of a falling stone might be significantly affected by the presence 
of air drag. Assume a quadratic drag force / = kv 2 and show that in fact the 
ratio of the modified displacement 6 and the displacement without any air drag 6 0 
is given by 



“ 1 + 20“ A 


for ah < 1 


where a = 2 k/m, m — mass of the stone, h — height above the ground from 
which the stone is dropped. For a stone of radius 1 cm and density 2600 kg/m 3 , 
a ~ 0.02 m - 1 . Find the ratio 6/6 0 for h = 200 m. 


3.7 


If a projectile is fired due east from a point on the surface of the earth at a geographical 
latitude A, with a velocity v a and at an angle of elevation above the horizontal of a, 
show that the lateral deflection of the projectile when the projectile strikes the earth 
is 



where u> is the angular frequency of the earth and g is the acceleration due to gravity. 
If the range of the projectile is R 0 for the case u = 0, show also that the change of 
range due to the rotation of the earth is 


R - R 0 


\ cos A cot 1 / 2 A — ^ tan 3 / 2 a 

V 9 3 J 


Copyrighted material 



Rotating Frames of Reference 117 


3.8 Find the velocity and acceleration of the tips of the horizontal and vertical blades 
of an ordinary revolving table fan. Assume that the blades revolve with a constant 
angular speed u;i, and that the horizontal shaft holding the motor rotates about a 
vertical axis sinusoidally with an angular speed u) = w o C 08 fit. 

3.9 Suppose we have an inertial system S„ with respect to which another system Si is 
rotating about their common z-axis with an angular speed $ in their x- y plane. A 
further system S 2 is rotating about the common x-axis of Si and Sj with an angular 
speed 6 in their common plane of y- z axes. Find the net angular velocity O and 
net angular acceleration a in the basis sets of S„, Si, and S 2 . 

3.10 A perfectly elastic ping pong ball is colliding back and forth along a horizontally 
aligned diameter of a hollow sphere. Show that due to the Coriolis force acting on the 
ball at that place (due to the earth’s rotation), the path of the ping pong ball will be 
rotating about the centre of the sphere at a rate exactly twice the rate of rotation of 
the plane of oscillation of Foucault’s pendulum, over a place of the same geographical 
latitude. 


Copyrighted material 



4 

Central Force 


4.0 INTRODUCTION 

Central force is one of the oldest and richest topics of classical mechanics. The first two 
correct laws of force, Hooke’s law of elasticity and Newton’s inverse square law of gravitation 
are central forces by nature. Coulomb’s electrostatic force between two charges, the van der 
Waal forces between neutral atoms and molecules in a gas, or even the Yukawa force between 
the nucleons in the nucleus of atoms are but other examples of central forces. 

Details of the central forces are usually taught at lower levels, except for adequate em¬ 
phasis on phenomena like tides, dynamical manipulations of the orbits of spaceships in this 
space age, the geometry of the orbits of planets, natural examples of virialised systems, etc., 
which are duly covered in this chapter. Of course, for the sake of completeness, the plane¬ 
tary laws of motion, the closure properties of orbits under central forces and the dynamics 
of collisions and scattering are included. 

Practically all great minds have, at one time or another, explored the problems of central 
force. Thus we find Kepler in Kepler’s laws of motion and Kepler’s equation, Newton in 
prescribing the law of gravitation, Bernoulli, Laplace, Hamilton and Lenz in finding out 
all the constants of motion for Keplerian orbits, Bertrand to find the conditions of closure 
under the action of general central forces, Halley, Euler and Gauss to study the orbits of 
planets and comets, Laplace to prove the stability of the solar system, Lagrange to solve 
three body problems, D’Alembert to solve the precession of earth’s axis of rotation, Jacobi 
to formulate the inverse square law problems in parabolic coordinates, Darwin to tackle the 
problem of tides, Delaunay to calculate precise orbit of the moon, Rutherford to study the 
scattering under inverse square law of forces, Clausius to find virial properties of central 
force systems, Poisson to give the differential equation for gravitational potential, and so 
on. 


4.1 DEFINITION AND PROPERTIES OF THE CENTRAL FORCE 

In this chapter we deal with a class of force laws having a particular kind of dependence 
on space variables (r,0,0). We start with the definition of a field of force, or in short, a 
force field. 

By a force field , or for that matter any vector field, we mean a rule (equivalently, a vector 


Copyrighted material 




Central Force 119 


function / : R 3 -*■ R s ) which assigns a unique force to every point in space, or in a 
specific domain, if it is not ubiquitous. 

If the force field is derivable from a scalar potential energy field, then we have an ordinary 
potential energy function F(r) defined in all real space, or if restricted, on the domain of 
/(r), such that 

fir) = - VV(r) (4.1) 

We are interested in the case where the potential energy function is a function of the scalar 
distance r = |r| from a fixed point in space. This fixed point is the source of the force field 
and is called the centre of force. If we choose this fixed point to be the origin, the potential 
energy function satisfies 

V(r) = K(|r|) (4.2) 


where r is the position vector of any arbitrary point. When this happens we say that the 
potential energy function is centrally symmetric and the corresponding force field is central. 


Using polar coordinates to describe the central force field, Eq. (4.1), coupled with Eq. 
(4.2) reduces to 


fir) 


dVjr) . 

dr 


(4.3) 


that is, the force is always directed towards or away from the centre (origin). We denote the 
magnitude of the central force at r by /(r). In general any given /(r) could be uniquely 
represented by a power in r, that is, 


fir) = £ k n r n 

n = - oo 


(4.4) 


where fc n ’s are either constants including zeros, or at most functions of time. 


4.1.1 Properties of the Conservative Central Force 


If a central force does not depend on time explicitly, it is called a Conservative central force. 

1 . Such central forces preserve homogeneity of time which implies the existence of an 
energy integral, that is, the total energy of a system driven by the field of a central force is 
a constant of motion. In order to see this more explicitly, we take a scalar product of Eq. 
(4.1) and the velocity of any particle in the system v to get 

mv • v + VU • v = 0 


5?G™’ + V ) =° 


which implies that the sum of the kinetic energy 
T 


1 2 
-mv 


Copyrighted materi 



120 Classical Mechanics 


and the potential energy (= V) is a constant say E such that 
T + V = E 

2. Only radial (r) dependence and no angular (0,^) dependence in /(r) implies that the 
isotropy of space is preserved by /(r) about its origin. This means that the total angular 
momentum of the system about the origin is conserved. To see this explicitly we note that 
the torque about the origin (T = r x /(r)) vanishes; hence the angular momentum 
h ( = m (r x t>)) is a constant of motion. 

3. Suppose the initial velocity of a particle moving under central force is not parallel 
to the force direction. The force is always directed towards a fixed point in space, so it 
has no component perpendicular to the plane defined by the partial velocity vector and the 
direction of the force. Hence the particle continues to move in this plane only. Thus motion 
under central force is planar. A simple proof of this would be as follows : Since the angular 
momentum is conserved, fc = m r x « is a constant vector, implying that both r and ® 
must always lie in the plane perpendicular to the fixed vector h. 

4. Once the orbital plane is known, the complete trajectory of the particle moving 
under the central force field is described by any two independent coordinates as parametric 
functions of time such as r(<) and 0(f). However one can eliminate t from these two relations 
to get r(0), which is just the equation of the orbit. The equation of the orbit, of course, 
cannot tell us where the particle is at any instant t. 

5. The initial position and velocity (r„,« 0 ) ; or the total energy, the angular momentum 
and the initial position of the particle in the plane of motion (E, A,r o ,0„); or the six orbital 
elements (a, e, n, t, Q, T see article 4.8 for definition) fix the whole problem to a specific 
state of motion. In other words, any six independent constants are required for a complete 
description of any particle’s motion under any central force. 


4.2 TWO-BODY CENTRAL FORCE PROBLEM 

Consider the motion of two particles, each of which is a source of a central force field 
and the potential energy of the system is a function only of their separation, that is, V = 
Vflri - r 2 |), where r x and r 2 are the position vectors of particles number 1 and 2 having 
masses mi and m 2 respectively (see Fig. 4.1). 

The position vector of the centre of mass (R) is defined through the equation 

ni! ri + m 2 r 2 = (mi + m 2 ) R (4.5) 

and the relative position vector of the particle 2 with respect to particle 1 is 

r = r 2 - ri (4.6) 


Copyrighted material 



Central Force 121 


Z 



Fig. 4.1 Motion of two point masses under the action of a central 
force between them 

The Lagrangian of the system is given by 

L = i (m, |r,| 3 + m 2 |f 2 | 2 ) - V (|r, - r 2 |) (4.7) 

We can eliminate 1*1 and ti from Eq. (4.7) and write it in terms of r and R using Eqs 

(4.5) and (4.6) and get the Lagrangian in the form 

L = '-MR* + i^f 2 - V(t) (4.8) 

where Af = mi + mi is the total mass of the system, and 

_ _mjm 2 _ _ /_1_ + _L)~* (4.9) 

mi +t »2 \mj mi/ 

is called the reduced mass of the system. 

Since L is cyclic in jR, R is a constant of motion and therefore the centre of mass can 
act as the origin of an inertial system. Again, since A is a constant of motion 1/2 MR is 
a constant which can be dropped from Eq. (4.8). Since r is measured from particle 1, the 
latter serves as the origin of a noninerti&l frame. This does not bother us, however, because 
we have first constructed the Lagrangian in an inertial frame (Eq. (4.7)) and then expressed 
it in terms of quantities defined with respect to a noninertial frame. Dropping the constant 
term 1/2 MR from Eq. (4.8) we can thus write, 

£ = \ ^ - V(t) (4.10) 

where r is given by Eq. (4.6) and r = |r|. 


Copyrighted material 



122 Classical Mechanics 


The above form of Lagrangian is such that it effectively corresponds to single particle 
motion with an effective mass equal to the reduced mass of the system, and the source 
of the centred force seems to act effectively as an immovable source situated at the origin. 
Actually this origin is moving with nonuniform velocity and acceleration. However, one may 
not be aware of this fact merely by looking at the explicit form of the Lagrangian given by 
Eq. (4.10). 

Because of the conservation of angular momentum (see p. 120), the orbit must lie in a 
plane. Expressing Eq. (4.10) in the plane polar coordinates ( r,9 ) defined in the plane of 
the orbit, 

L = ip (f* + r 2 0 2 ) - V(r) (4.11) 

Here 9 is a cyclic coordinate, therefore the generalised momentum conjugate to 9 is 
conserved. We have, 

(4.12) 


(4.13; 

This is a second order differential equation in r, and hence needs to be integrated twio 
in order to obtain the complete solution. However, it is always profitable to look for tl»v. 
existence of the first integrals of motion, and if they exist, one can take, for example, the 
energy integral, which is essentially a first order differential equation, and solve for the 
motion. The first integrals require the specification of the values of the respective integral;:. 
so one set of initial conditions are in fact already utilised through these, namely h and E 
The actual energy integral for the above problem is (see Eq. (2.60)) given by 

E = § * + fj *" L ■ \ ^ + rH1) + V(r) 

Since 9 and r are related through Eq. (4.12) (9 = h/pr 2 ) we can reduce the energy 
integral corresponding to a one-dimensional motion in r 

E = V* + [5 + V.rr(t-) (4.14) 

where 

V. ff (r) = V(r) + ~ (4.15) 

Equation (4.14) for energy suggests that the radial kinetic energy is 1/2 (fir 2 ) and the 
effective potential energy for the radial motion is Kfr( r )- It consists of two parts namely, 
V(r), which is the actual potential energy and l/2(h 2 /pr 2 ) = 1/2 (pr 2 9 2 ) which is the 
centrifugal potential energy for the radial motion (compare with Eq. (3.10)). 

The centrifuged potential energy increases indefinitely as 1/r 2 for r —* 0 (see Fig. 4.2). 


dL 

09 


pe = '^r — pr 2 9 = const. = h (say) 


As expected, p$ , the angular momentum of the system by definition, is conserved. 
Euler-Lagrange’s equation of motion in r is 


- (pr) - pr9 2 + 


dv_ 

Or 


Copyrighted material 



Central Force 123 


For attractive forces V(r) is negative for all values of r and asymptotically vanishes as 
r —» oo. 



Fig. 4.2 The monotonic gravitational (dashed line) 
and centrifugal (solid line) potential en¬ 
ergies of a particle as functions of r, for 
its motion considered in the radial coor¬ 
dinate only, even though the actual mo¬ 
tion taking place in a 2-D plane. The 
sum of the two, can have a minimum 
with a finite negative value, thus allowing 
a range of bounded orbits 


A motion is called bounded in r, if r vanishes at the extreme values of r say r = r m i„ 
and r = r llulx . Both of these bounds must exist for a bounded motion. Thus from equation 
(4.14) E = VW(r) for both r = r mM and r = r min . Next, note that 

E - V„»(r) = \»t 7 > 0 

for all r. Therefore for any physically possible radial motion we must have 
Kff(r) < E 

for every value of r accessible to the system. In other words, some portion of V^r) curve 
must lie below the curve = E in ordei to have an allowed radial motion. 


Copyrighted t atonal 





124 Classical Mechanics 


4.3 STABILITY OF ORBITS 


By an orbit we mean a scheduled path of any object moving under a central force. An orbit 
is called stable if, when a slight perturbation is given to the initial position, the orbit is 
perturbed only slightly. The perturbation is usually given to the radial coordinate keeping 
either the energy or the angular momentum unchanged. 

The condition for stability in radial motion is given by the existence of a local minimum 
in Kff(r), that is, we require 


0 2 v; ff (r) 

dr 2 


> 0 at the value of r, say r = r„, given by 


0V eff (r) 

dr 


= 0 


(4.16) 


If for any central force, potential energy function V(r) = br n+1 , b being a constant, 
and centrifugal potential energy K cf (r) = or -2 , (a > 0), where a is again, a constant, 
and —= 0 at r = r OJ so that 


(n + 1) b = 2o r~ 


Hence 


d?V«« 
dr 2 J r _ r 


2ar„ 4 (3 + n) 


Therefore, any circular orbit with r = r„ under any central force can satisfy the stability 
condition if 

n > - 3 

This can also be proved from more elementary considerations given below without using 
the conditions given in Eq. (4.16). 

The second order differential equation for an orbit under any central force /(r), obtained 
from Eq. (4.13) is given by 

dPu /xF(u) 

dP + u ~ ~ (4 ' 17) 

where u = 1/r and F(u) = /(1/u). 

Assume a circular orbit so that r = r„ which means u = u a = r” 1 . This gives, for 
the total energy, 


and from Eq. (4.17) 


( v.ir). = ~ »; + v„ 

_ A*^(uq) 


h 2 ul 


(4.18) 


Now we add a small perturbation to u, given by u = u„ + £ ( £ << u a ). 
The equation of the orbit becomes, 


de 2 


+ Uo + t = - 


+ 0 
h 7 (u 0 + 6 l 2 


Copyrighted material 



Central Force 125 


Expanding F(u 0 + £ ) in Taylor’s series around u„ we get 

fi , M , , _ _ p[ F M + + U 2 /2)F> 0 ) + •••) 

M2 + U ° + « - fi 2 (u 2 + 2 *U 0 + £ 2 ) 


_/*£W r ,/ **(«,) _ 2 _\ 

- h>ui l 1 + H*k) uj + 


Keeping the angular momentum constant and all the terms up to only the first order in £ , 
the above equation becomes, using Eq. (4.18), 


is + M = 0 


(4.19) 


where 


pF(u 0 ) f F'(u 0 ) _ 2\ 

+ h*ul \ F(u 0 ) u 0 ) 

or using Eq. (4.18) again, 

* = i -- 1) = 3 - (Sal) 

\F(u 0 ) uj \F(u 0 ) j 

The general solution of Eq. (4.19) is 

£ = Ci cos ( yf& 8) + C 2 sin ( y/A 8) for A > 0 
= C\ cosh ( y/— A8) + C 2 sinh ( y/—A 8) for A < 0 
= Ci 8 + Ci for A — 0 

Of the above solutions only the first one remains finite while the others increase indefinitely 
with 8. Therefore, a circular orbit of radius r a = l/u 0 is stable if and only if j 4 > 0, that 


(4.20) 


(4.21) 




din F(u) 
du 


< — = 3 r a 


Now, if F(u) = Ku n then uF'(u)/F(u) = - n, where K and n are constants. 

This means that for /(r) = kr n , the circular orbits are stable, if and only if n > - 3. 


4.4 CONDITIONS FOR CLOSURE 

An orbit is said to be closed if the particle eventually retraces its path (orbit). Or, in other 
words, closure of an orbit requires the period of radial oscillation to match with that of the 
8 oscillation. By the time u or r returns to its original value, 8 must complete an integral 
number of revolutions. This means that y/A in Eq. (4.21) must be a rational number, say 
p/q , where p and q are integers. In that case after q revolutions of the radius vector (that 
is, the rotation in 8 by 2irq), the value of u completes p oscillations about its mean value 
u„. However, this condition is not sufficient for closure of a general noncircular orbit. To 
obtain the relevant conditions one has to consider arbitrarily large deviations from a circular 


Copyrighted 



126 Classical Mechanics 


orbit. These, in turn, impose further restrictions which the orbit must satisfy, in order to 
be closed. 

It was proved by Bertrand, in 1873, that stable as well as closed orbits are possible only 
for A = 1 and A = 4. This fact is known as Bertrand’s theorem. The proof can be 
found in Appendix A of Goldstein’s book. Referring to the end of the previous section we 
note that A = 1 corresponds ton = - 3 + A = — 2, that is, /(r) cx 1/r 2 , which is 
the familiar inverse square law. The case A = 4 corresponds ton = — 3 + A = 1 or, 
/(r) cx r which is Hooke’s law or the law of harmonic forces. 

Let us summarise the above in the following points: 

1. All bounded orbits are closed only for the inverse square law of force of the Coulombian 
or Newtonian type and for the linear laws of force of Hooke’s type. 

2. A = 1 implies inverse square law, for which one oscillation in r is completed as 
soon as 9 changes by 2ir. Thus the radial and angular oscillations axe degenerate. However 
for Hooke’s type of the laws of forces \/~A — 2, so that one complete rotation in 9 by 2ir 
implies two complete radial oscillations. 

3. Both gravitation and electrostatic attraction provide situations where both these force 
laws are realised; an inverse square law outside any spherically symmetric homogeneous 
body (uniformly charged or neutral) and the Hooke’s law inside it. 

4. The condition for bounded motion is that there is a bounded domain of r in which 
Kff(r) < E , the energy. The condition for stability of circular orbits is n > — 3, where 
/(r) « r". The closed orbits exist only for n = 1 and n — —2. 


4.5 INTEGRABLE POWER LAWS OF THE CENTRAL FORCE 

We now obtain a first order differential equation for the orbit and write its formal solution. 
We start with the energy integral, 


We can write 


3 that 


E = i/i(r 2 + r 2 9 2 ) + V(r) 


• ± . dr (_h_\ 

d9 d9\nr 2 ) 

„ 1 / h 2 \ (dr\ 2 1 h 2 

E ~ 2 Ur* )\d9j + 2^r2 + 


d9 V /j2 


V(r)\ - r 2 


Copyrighted material 



Central Force 127 


Its formal solution can be written as 


9 = 9 0 + f - -■ -- = 0 o 

J - T*J%\E - V(r')| - ^ 


r , _ 


where, as before, u = 1 /r. 

Now the following points may be noted: 

1. For given E , h and the form of V(r), the orbit is fixed. u a and 9„ refer merely to 
the starting point on the orbit. 

2 . If V(r) oc r n+ *, the above integral can be directly integrated for n = 1,-2 and 
- 3. 


3. For n = 5, 3, 0, - 4, - 5 and - 7, the results can be expressed in terms of elliptic 
integrals where, by definition, an elliptic integral is / R(x,w)dx with R being any rational 
function of x and w is defined by 


w = y/ax A 4- 0x z + 7 X 2 4- 6x + T] 


such that a and 0 cannot be simultaneously zero, and 7 , b and 7/ are constants. We are 
not giving any proof of the statements, for which one may consult Goldstein’s book. 

4. For other values of n, the equation of the orbit cannot be expressed in closed form. 


4.6 DERIVATION OF FORCE LAWS FROM KINEMATICAL LAWS OF MO¬ 
TION 


Kepler’s laws of planetary motion do not provide any explanation for planetary motions; 
instead, they are merely the descriptions of the motion. Velocity, acceleration and areal 
velocity are kinematical quantities, the corresponding dynamical quantities being linear 
momentum, force and angular momentum respectively. The difference between these two 
sets of quantities is primarily the mass factor. We know that acceleration, a kinematical 
quantity, is defined as a = dv/dt. Its dynamical equivalent, Newton’s second law of motion, 
given by F = dp/dt is far more significant than its kinematical counterpart, although 
both are merely definitions. Similarly Kepler’s second law of constant areal velocity can be 
explained by a dynamically very significant conservation law, that is, the conservation of 
orbital angular momentum of the planets. However, historically it was a giant step forward 
when Newton derived a fundamental force law of nature from Kepler’s laws of planetary 
motion, using his laws of motion. Here we shall present two examples of derivation of force 
laws from given kinematical laws. 


4.0.1 Newton’s Law of Gravitation From Kepler’s Laws of Planetary Motion 


Kepler’s first law suggests that the equation of the orbit in plane polar coordinates about 
the focus is (see Fig. 4.3) given by 


r = «(> - **) 
1 + e cos 6 


14-23) 


Copyrighted 



128 Classical Mechanics 


where a is the semimajor axis and e the eccentricity of the orbit. Kepler’s second law 
states that the areal velocity is constant, that is, 

r 2 6 = const. = H (say) (4.24) 



Fig. 4.3 Finding the general equation of a conic section. From the above construc¬ 
tion LD = MM' -f FM cos 0, M being any point on the conic, the extended 
line DD„ the directrix, LL' the latus rectum, P the pericenter, and F the 
primary focus 


The force on the planet at any instant (using Newton’s second law of motion) is 


F — mr — m(f - rd 2 )f + m(rd + 2r9) 0 (4.25) 

Differentiating Eqs (4.23) and (4.24) with respect to t one can express the RHS of Eq. 
(4.25) in terms of r and 6 variables to give, 


F 


mH 2 

o(l - e 2 )r 2 * 


(4.26) 


Since for any planet, m, a, H 2 and e are all positive constants and e < 1, the force is 
not only central but also attractive in nature and follows an inverse square law of distance 
from the sun. 

Now one has to check whether the constant of proportionality between F and 1/r 2 
would be the same for all planets because m, a, e, H are all different for different planets. 
Here comes in the use of the third law. 

Kepler’s third law says that the orbital period P oc a 3 / 2 or, 


P 2 = K 0 2 a 3 


(4.27) 


Copyrighted material 



Central Force 129 


K 0 being the same for all planets . Now from the definition of areal velocity and its 
constancy, 

r 2 B = H =2tt ab/P = (27roVl “ e*)/P (4.28) 

Combining Eqs (4.27) and (4.28) we get 

2 47r 2 a(l - e 2 ) 

H = —if?— 


so that 


F 


47r 2 m . 


where K„ is the same constant for all planets. Therefore it turns out that apart from the 
inverse square dependence on distance and a constant factor, the force of attraction between 
any planet and the sun is also proportional to the mass of the planet. 

Now one uses Newton’s third law of motion, that is, the planet must be attracting the sun 
with the same but opposite force. This is possible only if the factor 1 /JC 2 is proportional 
to the mass of the sun M©, leading finally to Newton’s law of Gravitation 


_ GM Q m . 
r 2 

where G is a universal constant. However, Newton could become sure of the universality of 
this law only after finding that it gave him the right value of the acceleration due to gravity 
on the surface of the earth, and also quantitatively explained Galileo’s kinematical laws of 
freely falling bodies, the observed relationship between the length and period of oscillation of 
any simple pendulum and the motion of the moon around the earth. Thus, during Newton’s 
lifetime, the universality of Newton’s laws of gravitation extended at least up to the scale 
size of the solar system, or more precisely, up to then known outermost planet Saturn. 


4.0.2 Force Law Corresponding To Ptolemy’s Epicyclic Model 

Basically Ptolemy’s epicyclic model suggests that each planet including the sun and the 
moon is moving in a circle called epicycle , the centre of which is again moving in a circle 
called deferent. In the most primitive forms of the geocentric models the earth was assumed 
to be at the centre of all deferents and the angular speeds on the deferent (u>i) and on the 
epicycle ( 0 / 2 ) were assumed to be constants. (However, in the actual models of Ptolemy, of 
Aryabhata I and of others, the earth was assumed to be slightly displaced from the respective 
centres of the deferents, and the origin about which was assumed to be constant was 
yet another point called the equant as shown in Fig. 4.4). 


Essentially the orbit of a planet can be represented by 


where 


ri + r 2 


fi = a[cos(u/i t)i + sin(a;it)j), and r 2 = 6[cos(u;20* + sin{u} 2 t)j] 


Copyrighted 



130 Classical Mechanics 


Planet 



Fig. 4.4 Motion of a planet in Ptolemy’* geocentric model of a 
rigid body with respect to an outside inertial frame at O 
and a body frame at B 0 


a being the radius of the planet’s orbital deferent and b the radius of the planet’s orbital 
epicycle (see Fig. 4.4). Now, r'i = x f|, where o>i = wi k, say, k pointing in a 
direction perpendicular to the orbital plane of the planet, and similarly r 2 = u> 2 xr 2 ,with 
u> 2 = u 3 k, giving 

f = Wi X T] + tf 2 X f 2 
f = u»i x (u/i x ri) + w 2 x (u/ 2 x r 2 ) 

It is now easy to show that 

(tdi + u> 2 ) x f = f - (<*>! u/ 2 )r 


or 

f = (wj w 2 )r + (o>i + « 2 ) x r 

Since u )i and w 2 are taken to be constants and are either parallel or antiparallel, this 
force law corresponds to an isotropic (charged) oscillator (represented by the first term) 
placed in a uniform magnetic field (represented by the second term). Even though such 
a force law is not totally unphysical, as it corresponds to an oscillating charged particle 
inside a magnetron, it would be extremely difficult to justify such a physical scenario for the 
motion of planets in the solar system. Hence Ptolemy’s epicyclic model was not considered 
a viable physical model of the solar system, even if its kinematical descriptions were proved 
to be correct. 


Copyrighted 



Central Force 131 


4.7 KEPLER’S PROBLEM 


Kepler’s problem is the inverse of Newton’s problem; starting with Newton’s law of grav¬ 
itation, one now has to deduce Kepler’s laws of planetary motion. This is a central force 
problem with the law of force given by the Newtonian inverse square law, namely 


/(f) = - 2^9 and V(r) = - 2*^ (4.29) 


where r is the radius vector of the planet measured from the centre of the sun. M© and 
m are the masses of the sun and the planet respectively. It was not Newton but Jacob 
Hermann, a student of Johannes Bernoulli, obtained for the first time in 1710 the orbit 
equation for Newton’s law of gravitation. 

The equation of motion under Newton’s law of gravitation is given by 


or 


with 


d?r 

M d < 2 


fW = 


GM @ m. 


d?r G(Mq + m) A K. 
dt 2 “ r 2 r 2 


C(M 0 + m) = GM e (l + 


(4.30) 


Taking vector product with r on both sides of Eq. (4.30), we get, 


which means that the vector 


H 


r x 



aT „ ./ , V 

r x Ti ~ 2A * 9ay * 


(4.31) 


is a constant of motion. One can easily identify A' as the areal velocity vector. Thus Eq. 
(4.31) states that the radius vector of the planet sweeps equal areas in equal intervals of 
time, proving Kepler’s second law. 

Since r • H = 0, r is always perpendicular to the H vector, that is, r is confined to 
the plane perpendicular to the H vector. Now consider, 


d*r 
dt 2 


-rjr X H 


£ r x ( r x £) - *50 


Using the fact that dH/dt = 0, we can transform the LHS to get after integration 

^ x H - K- = const. = A (4.32) 

dt r v ' 

where Eq. (4.32) defines the constant vector A. Thus we get another constant of motion 
A called the Rxinge-Lenz vector. It is also called the Laplace vector or even sometimes the 
Laplace-Runge-Lenz vector. However, the actual credit should have gone to Jacob Hermann 


Copyrighted material 



132 Classical Mechanics 


who was the first to obtain the correct magnitude of this vector in 1710, and to Johannes 
Bernoulli who found its direction in 1713. 

Since A H = 0, we see that A is perpendicular to H or, A is a fixed vector lying in 
the plane of the orbit. 

Let us now proceed to obtain the equation of the orbit. We start with, 


= r { A + ¥) = rA + Kr 


Therefore, we get for r 


H 2 /K 

i +14* c ° sS 


(4.33) 


where 6 is the angle between the r and A. Equation (4.33) is the equation of the orbit. It 
has the form 

(4.34) 


1 + ecos0 


Equation (4.34) has the form of the equation for a general conic section (cf. Eq. 4.23). 
Thus Kepler’s first law Is proved. The planetary orbit is a conic section with p = H 2 /K 
as the semilatus rectum, e = \A\/K = A/K as the eccentricity. 6, which is the angle 
between r and A is called the true anomaly , that is, the angle between the perihelion (by 
definition, a point on the orbit closest to the sun) and the radius vector. This identification 
makes it clear that A lies in the direction of the perihelion (as r is minimum for 6 = 0) 
and aligns with the major axis of the conic section. Aphelion is defined to be a point on the 
orbit farthest from the sun, that is, corresponding to 8 = ir in Eq. (4.34). 

If the orbit is not referred to any specific central object, one usually refers to these two 
points as pericenter and apocenter. If the orbit is around the earth instead of the sun one 
uses the terms perigee and apogee instead of perihelion and aphelion. Similarly, for the 
orbit of a star around another star, astronomers use the terms periastron and apoastron 
respectively. The line of the major axis is also called the apsidal line or simply apsis, to 
include the cases of unbound (parabolic and hyperbolic) orbits as well. 

We further have, 

\A\ = A = Ke = GM 0 e (l + (4.35) 


Because its magnitude is proportional to e, A is sometimes called the eccentricity vector. 
Historically it was so named by Hamilton (1845). 

In order to know which of the conic sections can possibly represent a planetary orbit, it is 
necessary to obtain a relation between the orbital eccentricity and the specific energy (E') 
of the planet, that is, energy per unit reduced mass of the planet. We get from Eq. (4.32), 


A 2 = 


H 2 + 


- —H 1 


Copyrighted material 



Central Force 133 


and then, differentiating with respect to t , 


giving 


ld_ 2 _ dr fr _ d_ (K\ 

2 dV° ' ~ dt‘ dt* ~ dt\r ) 

L( 1+ _ a) = L(B) m o 

ii\2 t ) dt \fi J 


However, the expression in the bracket is the specific energy, denoted by E' = E/p.. Using 
this expression for E' we get 

A 2 = 2 E'H 2 + K 2 (4.36) 


Therefore, from Eq. (4.35) 


e 


A_ 

K 



2 E'H* 
K 2 


(4.37) 


Thus the specific energy E' < 0 implies e < 1, E' > 0 implies e > 1, E' = 0 
implies e = 1 and E' = - K 2 /2H 2 gives e = 0. It is well knpwn that for 0 < e < 1 
the conic section is an ellipse , for e = 1 it is a parabola , for e > 1 it is a hyperbola, 
and for e = 0 it is a circle. But we know that, for the attractive inverse square law, the 
requirement that the orbit be bounded corresponds to - K 2 /2H 2 < E' < 0 implying 
0 < e < 1. Thus the planetary orbits must be elliptical. (However, for a repulsive inverse 
square law, K < 0, hence always E' > 0 implying e > 1 or no bounded orbits.) 


For elliptical orbits (see Fig. 4.3), the length of semilatus rectum is 
p = a(l - e 2 ) 


a being the length of the semimajor axis. Hence, from Eqs (4.33), (4.34) and (4.37) 


K 

a ~ 2E' 

Using the expression for E' we get 

(4.38) 


(4.39) 


This relation is pictorially illustrated in Fig. 4.5. The expression (4.39) can be rewritten as 
1 2 _ K K_ 

2 V r 2 a 

which is the gain in specific kinetic energy of any particle dropped from rest from a height 
2a from the force centre to a height r from the force centre . If we draw a circle of radius 
2a about the force centre 0, the actual elliptical orbit lies inside this circle. The actual 
speed of the particle at any point on the orbit is the same as that gained by a free fall from 
r' = 2a to r' = r, where r' is also measured from the force centre 0. 

Note that we have proved the first two of Kepler’s laws namely the ellipticity of the 
planetary orbits and constancy of areal velocity. We shall now prove the third law which 


Copyrighted material 



134 Classical Mechanics 



Pig. 4.5 The variation of the Keplerian orbital apeed along the 
orbit of a planet can be viewed aa that due to the dif¬ 
ferential gain from free fall from the rest from a circle of 
radius equal to the length of the major axis of the orbit, 
to the location on the orbit, shown by the arrowed paths 
of the assumed free fall 


states that the square of the period of revolution is proportional to the cube of the semimajor 
axis. Since the areal speed H /2 is constant, we can write 

! ,r _ 

-H dt = total area of the ellipse = ira, 2 s/\ — t 7 

2 Jo 

where P is the period of the orbital motion. This gives 


p = 

27ra 2 \/l — e 2 

H 

(4.40) 

e 2 ), we get 



P 2 

4t 2 

(4.41) 

o 3 “ 

G(M 0 + m) 


which is the required law, provided we neglect m in comparison with M 0 . 

We conclude this section with the following remarks: 

1. Planetary laws of motion can only evaluate K and less precisely m/M 0 (by using 
Kepler’s third law and Eq. (4.41)). 

2. Using these laws, one cannot evaluate G and A/ 0 separately; only the product GAf 0 
can be evaluated. Thus, only if G has been determined by an independent method, say by 


Copyrighted materia) 



Central Force 135 


any laboratory method, the mass of the sun, M©, can be determined. The value of GM© 
is known quite precisely from Newton’s time, but even now the best accuracy of G hardly 
goes beyond four significant digits. Carl Gauss had adopted, in 1809, a value of GM© 
defined by A: 2 = GM©. Here lb is called Gauss’ constant of gravitation for the sun. Its 
value is fixed by his choice 

k = y/GM^ = 0.01720209895 AU 8/2 day _1 exactly 

so that the period of revolution of a test particle orbiting around the sun at a distance of 
1 AU is P = 2ir/k. This provides a definition of AU, the astronomical unit. So 1 AU 
need not be exactly the average distance of the earth from the sun. Since k has been fixed 
for all time, any revision for the mass of the earth (actually m/M© ratio) would call for a 
revision in the actual length of AU in SI units. According to the best estimate to this date, 
M© = 328900.55 times the mass of earth plus moon, giving the length of the semimajor 
axis of the earth’s orbit a = 1.000000034 AU, where 1 AU = 149597870.66 km, and GM© 
= 1.32712438 xlO 20 mV 2 . 

3. We muct remember K = k 2 (l + m/M©). It differs slightly from planet to planet. 
So Kepler’s third law is not exact, since K is not the same for all planets. 


4.8 ACTUAL GEOMETRY OF ORBITS AND ORBITAL ELEMENTS 

The plane of the earth’s revolution around the sun, called the ecliptic, is taken to be the 
reference plane for the ecliptic (polar) coordinate system (see Fig. 4.6). Celestial longitude is 
measured along the ecliptic in an anticlockwise sense and the celestial latitude is the angular 
elevation or depression with respect to the ecliptic. The origin of the longitude is set by the 
line of intersection of the ecliptic and the celestial equator, the latter being the extended 
plane of the earth’s equator. This origin defines the direction of the Vernal equinox (T) or 
the First Point of Aries. The orbital plane of any other planet may make an angle i, called 
the angle of inclination, with the ecliptic. The two orbital planes, namely, those of the 
earth and of the planet concerned, intersect along a line called the nodal line or simply the 
orbital node. There are obviously two nodes: through one the planet moves from the south 
of the ecliptic to the north of it, called the ascending node, and through the other the planet 
moves from the north to the south of the ecliptic, called the descending node. The location 
of the ascending node on the ecliptic with respect to its standard origin (T) is given by the 
angular quantity fl, defining the longitude of the ascending node. Similarly, the location of 
the perihelion of the planet’s orbit is expressed by the longitude of the perihelion (T), and 
the true distance of the perihelion from the sun by q = o(l - e). Specifying further the 
orbital eccentricity e, and the length of the semimajor axis a (or equivalently, the orbital 
period P) determines the orbit precisely in space. The initialisation of the orbit is done by 
supplying the epoch of the perihelion passage of the planet, which is usually denoted by T a . 

In order to fix the orbit of a planet in space we thus require six independent quantities 
called the orbital elements. The six orbital elements are a (or P), e, T, fi, i and T a , all 
of which have been defined in the preceding paragraph. We list them again for a quick 


Copyrighted 



136 Classical Mechanics 



Fig. 4.6 Orbital elements of planetary orbits explained 

reference. 

a = seuiimajor axis, or P = orbital period = 2n y/a^/K 
e = eccentricity 
T = longitude of the perihelion 
ft = longitude of the node 

i = inclination of the orbit with respect to the ecliptic, and 
T„ = epoch of a perihelion passage. 

We now define the concept of anomaly. Anomaly, by definition is a measure of the angular 
advance of the planet centred at the sun, from its last perihelion passage. Usually, the 
following kinds of anomalies are defined: 

True anomaly (i/) = actual angle at the focus = ZHFM in Fig. 4.7 = same as 9 in Eq. 
(4.33). 

Mean anomaly ( g) = 2ir(t - T 0 )/P> where i is any instant of time, and 
Eccentric anomaly ( E ) = ZPOM' in Fig. 4.7. 


Copyrighted 



Central Force 137 



Pig. 4.7 The eccentric and true anomalies (E and u) explained with refer¬ 
ence to the auxiliary circle 


Figure 4.7 shows an elliptic orbit with its auxili ary circle . An ellipse is known to be an 
affine transform of its auxiliary circle in the ratio Vl - e 2 to 1. 

The three kinds of anomalies defined above are not independent. The relation between 
E and v can be obtained as follows (refer to Fig. 4.7). We have 

x = FH = rcosi/ = a (cos E - e) 

y = HM = rsini/ = (asinE) (HM/HM') = a^l - e 2 sin£ (4.42) 
r = y/z 2 + y 2 = a(l — ecosf?) 


and from the 1st and the 3rd of Eq. (4.42) 


u _ ll - cost/ _ /1 + i 

1 2 V 1 + cos v V 1 — 


(4.43) 


which is one of the required relations, namely the relation between the eccentric and true 
anomalies. 


4.0 KEPLER’S EQUATION 

Kepler’s equation represents the relation between the eccentric and mean anomalies. This 
can be geometrically obtained as follows. 


Copyrighted 



138 Classical Mechanics 


Referring to Fig. 4.7 one can write 

The sector area PFM = Areal velocity x (t - T a ) 

= i*(«-T.) = iStS. 

2 v 1 2 2* 

Using Eq. (4.40) we get, 


The sector area PFM = ^a 2 y/l — e 2 < 


Again, since, 


Sector a 


i PFM 




Sector area PFM' 
the sector area PFM' 

It is easy to see, however, that 

sector area PFM' = sector area POM' — triangular area FOM' 




= \cl 2 E - ^ a 2 esin E 

2 2 


E — esin E 


- g - 


To) 


(4.44) 

This is the famous Kepler’s equation relating E to g, or equivalently E to t. 

In the above we have derived Kepler’s equation for an elliptic orbit. Equivalent forms for 
hyperbolic and parabolic orbits exist. We summarise them in Table 4.1 (p = H 2 /K , Eq. 
(4.34)). In the parabolic case, the expression can be found in closed form and the equivalent 
of Kepler’s equation is what is known as Barker’s equation. 


Table 4.1 Equivalent Forms of Kepler’s Equation for Parabolic, Elliptic and 
Hyperbolic Cases 


Quantity 

Elliptic case 

Hyperbolic case 

Parabolic case 




(s = tan u/2) 

r cos v 

a(cosE — e) 

a(coshF - e) 

Pi. 1 - « J )/2 

rsini/ 

a\/l — e 2 sinE 

ay/e 2 — 1 sinhF 

pa 

t - T 0 

P(E - esin E)/2ir 

F + esinhF) y/^/K s( 1 + s 2 / 3)/2 


The entries in the last row are the equivalents of Kepler’s equations for the above three 
cases. Usually one wants to determine the values of r and v for a given value of t. Of 
course, all the orbital elements are usually known. So the first step is to use Kepler’s Eq. 


Copyrighted material - 



Central Force 139 


(4.44) or its approximate equivalent for hyperbolic and parabolic cases, for evaluating an 
intermediate quantity E , F or s. Since Kepler’s equation is transcendental in E or F , it 
has to be evaluated numerically (or using any series expansion for E or F). Once E, F or 
s is determined, the first two rows of the above table give the values of r cos v and rant/ 
from which r and v can be easily evaluated, thus specifying the location of the object on 
the orbit at any instant t. 


4.10 CONSTRUCTION OF AN ORBIT FROM GIVEN SET OF INITIAL CON¬ 
DITIONS 


Any two-body problem in celestial mechanics is basically a two-dimensional problem, which 
means that only four independent initial conditions are to be specified. Let us suppose 
that the given or specified quantities are r 0 and v a . Assuming m < M©, we can take 
K = <?M©. The geometrical construction of the orbit in the plane of the drawing sheet 
involves the following steps: 


(i) We can write the conditions for the nature of the orbit (see section 4.7) in terms of 
v 0 and r 0 and test which of the following conditions is satisfied. 

2 2GM© 

v l > —“—~ 

2 CM© 


2GM© 


GM q 


hyperbola 

parabola 

ellipse 

circle 


The quantity y/2GMo/r 0 is called the escape velocity at r„. Since G, Af©, r„ and 
v 0 ( = |t» 0 |) are known, we can thus determine the nature of the orbit. 


(ii) If the orbit is elliptical, the length of the semimajor axis is obtained from Eq. (4.39) 
and then do the following. 


(iia) Draw a normal PN to the initial velocity direction v a at P and join the point P to 
the force centre F (see Fig. 4.8). Now it is a property of the conic sections that the normal 
PN bisects the angle ZFPF 1 , F' being the other focus. Draw the line showing the direction 
of the secondary focus PF' from P. Now, use another property of the ellipse namely FP + 
F'P = 2a. Since we know a and FP we can calculate the length F'P = 2a - r 0 = 
say. This gives the location of the secondary focus F' on the line PF'. 

(iib) Join FF' by a line and extend it. This will be the major axis. 

(iic) Again from the relation FF' = 2ae, we can obtain the eccentricity e of the orbit. 
However we can complete the drawing of the ellipse using a piece of string with ends tied 
at F' and F, and running a pencil through P with the cords always stretched to maximum. 


Copyrighted 



140 Classical Mechanics 



Fig. 4.8 Construction of the unique Kcpleri&n orbit from the given ini¬ 
tial position and velocity with respect to the force centre, whose 
strength in terms of GM is known. 


For hyperbolic orbits rules (iia - c) are quite similar except that FP - FP' = 2a. For 
drawing a hyperbola, take a zipper tape instead of a piece of string. Open it half way 
though, nail at two points of the open arms. These two points must be the foci. Push the 
pencil through the zipper to draw the hyperbola. 


4.11 KEPLER’S PROBLEM IN VELOCITY SPACE 


We now obtain expressions for the radial and transverr'' velocities of the planet. 


(i) Radial Velocity 


Taking Eq. (4.34) as the equation of a conic section, the radial component of velocity of 
any planet is given by 


v r 


dr Ife sin 9 I K . „ 

li = —— = \l^rr?-) e3me 


Thus v r changes periodically as sin0 does. The amplitude of variation is t/K/a( 1 — e 2 ) e. 
Also, v r is maximum when the particle is on the latus rectum (6 = ± tt/ 2) and v r = 0 
at the pericenter and the apocenter. 


(ii) Transverse Velocity 
We have, for the transverse velocity vg, 


vg = r 0 


v fyK = I K{ 1 - e 2 ) a 
r V a r 


(4.46) 


Copyrighted 



Central Force 141 


The speed of the planet is given by Eq. (4.39). Therefore, 


Kr 

a ~ (2K - rv 2 ) 



(iii) Representations of a and e in Velocity Space 

Since the circular velocity for a given r is v c = y/K/r , one can conveniently express all 
the velocity components in units of v c and a in units of r. Thus, defining 

v r v» a 

v r = — Vg = — and a — - 

v c v e r 


we finally have 



(4.47) 



Pig. 4.0(a) Contours of constant lengths of semimajor axes of Keplerian or¬ 
bits, drawn in the polar mapping of the velocity space 

The range of a is from 1/2 (for v = 0) to oo (for v = y/2). The curves of constant 
semimajor axes are therefore circles as shown in Fig. 4.9(a). All circular orbits have v r = 0 
and vg = 1 so they are all crowded at y = 1 on the y-axis. Orbits with u ~ y/2 (from 
below) have speeds very close to that of the escape speed. The curves of equal eccentricity 
intersect the y-axis twice, once at pericenter (top) and the other at apoceuter (bottom). 
The pericenter has higher v 9 than the apocenter (see Fig. 4.9(b)). Since the pericenter is 


142 Classical Mechanics 


close to the boundary of the bounded orbits, the speed at pericenter approaches the escape 
speed that is why the motion of a comet can be approximated by a parabola when it comes 



space 


(iv) Equation of Orbit in terms of Vg 

Since H = r 7 0 and ve = r $, one gets vg = H/r = Hu , where H is a constant (see also 
Eq. (4.46)). So both u and vg will satisfy similar differential equations for orbits under 
any central force. We know that for attractive inverse square law, u satisfies 



d 2 u 

K 


W 

+ u = w 

Hence vg should satisfy 

tPvg 

K 


d9 2 

+ V9= h 

which has a solution 

II 

$ 

+ Dcos(6 - 


D and 0„ being constants of integration. The value of vg simply oscillates about its mean 
value K/H with an amplitude D. As vg = H/r , the above solution in vg corresponds to 
the equation for conic section r = r(0). On comparison it turns out that D = Ke/H , e 
being the eccentricity of the actual orbit. 


4.12 ORBITS OF ARTIFICIAL SATELLITES 
(i) Geosynchronous Orbit 

The orbit of any satellite around the earth that has an orbital period the same as that of the 


Copyrighted 



Central Force 143 


earth’s diurnal rotation (P©) is called a geosynchronous orbit. Any such orbit must satisfy 
the condition, 

P© = 23 hours 56 minutes 4.099(±0.003) seconds 

From Kepler’s third law applied to earth’s satellites, the semimajor axis of the geosyn¬ 
chronous orbits are 

a, = ' x-P© 3 = 42,164.2 km 

where A/© is the mass of the earth. 

Given that the orbit is geosynchronous, it can have any eccentricity and any orientation 
with respect to the earth’s equator. 

(ii) Geostationary Orbit 

The geostationary orbit of the satellite around the earth is such that the satellite remains 
stationary with respect to all points on the surface of the earth. This requires that the orbit 
must 

(a) be geosynchronous, (b) be circular, and (c) stay over the geographical equator of the 
earth. 

The height of such an orbit from the surface of the earth is therefore given by 
a„ — P© = 35,786 km 

(iii) How to Put a Geostationary Satellite into Orbit 
This is done in two steps : 

(a) The satellite is directly launched into a low altitude orbit called the transfer orbit, 
having perigee ~ 200 km above the earth’s surface and apogee touching the geostationary 
orbit at a height of 35,786 km. This requires the eccentricity of the transfer orbit to be 
about 0.73. 

(b) Since the final orbit has to be circular with radius r = 42,164 km a suitable thrust 
at the apogee of the transfer orbit is required. Referring to Fig. 4.9(b) and using Eq. (4.47), 
for e = 0.73 one obtains AC = OC/2 1.5 km/s. Thus for raising from A to C one needs 
to raise the velocity from 1.5 km/s to 3 km/s. The European rocket Ariane V is launched 
on the basis of the above principle. 

The American Space Shuttle follows a different procedure. It starts orbiting the earth in 
a nearly circular orbit at a height of about 200 km from the ground (see Fig. 4.9(c)). Next, 
to put the satellite into an elongated transfer orbit would require an impulse to be given at 
the perigee. With reference to Fig. 4.9(b), the required move is to push the satellite from C 
to P which requires imparting v„ = 0.3, that is, 0.3 times the circular velocity corresponding 
to the initially put low orbit. Since v e = 7.9 km/s for all circular orbits close to the surface 
of the earth, the required v„ ~ 2.4 km/s. The final step, namely pushing from point A to 
C in Fig. 4.9(b) in order to transfer the satellite from its highly elliptical transfer orbit to 
the perfectly circular geostationary orbit the second step, is identical for both the European 
Ariane and American Space Shuttle programmes. 


Copyrighted 



144 Classical Mechanics 


3km /s 



(iv) Why Should Rockets be Fired off from the Perigee rather than the Apogee f 

The equation of motion of a rocket, having a variable mass, moving in the free space under 

the rocket action of its own, is given by equation (1.27), 


_ dv 

F = m ii 


dm 

U " It 


where dm/dt is the rate of change of the total mass due to fuel consumption, u„ is the 
velocity of the ejected gas relative to the rocket, m is the mass of the rocket at any instant 
and v is the velocity of the rocket. The power gained at any instant due to the rocket 
action is given by 

„ , . dm 

Fv = (vu.,)- 


For a constant rate of fuel consumption and the speed of the ejecta, the power gain is 
proportional to v • u„ = u„vcos 9, where 9 is the angle between » and u„. In order to 
maximise the power gain, we need 9 = 180" (since dm/dt < 0) and v as large as possible. 
Since the speed of the satellite or of the rocket is highest at perigee, the rocket should be 
fired when it passes tlirough the perigee for a maximum gain in kinetic energy in minimum 
possible time, that is, with the minimum expenditure of fuel. 


4.13 PRECESSION OF THE PERIHELIA OF PLANETARY ORBITS DUE TO 
SMALL PERTURBING NONINVERSE SQUARE LAW OF FORCE 

In this book, we have not dealt with the general theory of perturbation. We take this 


Copyrighted material 


Central Force 145 


opportunity to introduce the readers to perturbative analysis, by way of studying the effect 
of small perturbation in a central force field. The theory of perturbation is based on the 
main premise that the perturbing component of force is negligibly small compared to the 
force contributed by the main source of force. Since we have already analysed in great detail 
the motion of particles under an inverse square law of central force, we take the main source 
of force as due to an agent producing the field of force that follows a perfect inverse square 
law of distance. The perturbing force is assumed to be sufficiently weaker in strength, and 
itself a central force obeying a different power law of distance. 

The efTect of including a small component of non-inverse-square-law of central force is 
that it would be directly related to the departure from the condition of orbital closure, or 
equivalently, to the nonconservation of the Runge-Lenz vector which always points towards 
the instantaneous pericenter of the orbit. 

We know that for any central force, 

r 2 6 = H = const. 

aud 

f - r$ 2 = F(r) [=-=• for the inverse square law) 

r* 

Therefore, 

H 2 

T - 7T = -fW 

We take a circular orbit of radius a so that f = f = 0, which means F(a) = - H 2 /a 3 . 
Now for a small perturbation r = a 4- £, £ < a, we have, to the first order of (/a, 

f = f, F(a + 0 = F{a) + ^(a) 

and r- J = (a + ()-• = a~ s (l - 

leading to 

( - | F'(a) + jf(t)]{ = 0 

Hence the period of radial oscillation r r is given by that of £, that is, 

r 3 i - 1 / 2 

r r = 2 tt — F'(a) - - F(a) 

We know that r takes the extreme values on the line of apse. Let be the apsidal angle 
swept by the radius vector r between the two consecutive passages through the line of apse 
in the same direction. We must then have, 

y, = 

Since 

H 2 

F{a) = - fL- and 
a J 


\ Tr ° 




Copyrighted material 



146 Classical Mechanics 


we can express ip as 


Ip = 7T 


3 + 


aF'( o)l _1/2 

F(*) . 


For an inverse square law, F(r) = - K/r 2 so that aF'(a)/F(a) = - 2, giving, 

Ip = 7T 


This happens because the orbital period is equal to the period of radial oscillation. 


Now let us allow some perturbation in the force term which follows a noninverse square 
law, namely 

F(r) = F 0 (r) + F,(r) 

say, where F„(r) = - K/r 2 and J*j(r) is any small correction to F 0 (r) with any other 
power law of r. Then upon substitution in the expression for ip , and further simplification, 
we find 

Thus the amount of precession per half period is ip - it, and therefore, the angular velocity 
of precession is given by, 


2tt Fi(a) + ^aFl(a) 
P F 0 (a) 


— 


(4.48) 


where 2ir/P is the orbital angular velocity = ui v . The sign of precession depends on those 
of the factors (aF((a)/Fi(a)) and F„(r). Thus, for attractive inverse square law of the main 
force F„(a), 


aFl(a) 
Fi(a) > 


- 2 


corresponds to the advance of perihelion, while 


«*?(«) 

Fi(a) 


< -2 


gives retrograde motion of the perihelion. 


Equation (4.48) is valid for nearly circular orbits for which eccentricity e can be neglected. 
We need a more general treatment for moderately eccentric orbits. This can be done using 
the Runge-Lenz vector in the following way. 

Let us define the specific Runge-Lenz vector by, 


A - v x H - Kr (4.49) 

where K = G(Af© 4- m), H = specific angular momentum = r x p//x, fj, = 
reduced mass, f = s cos 6 + j sin 0 and d = - * sin 0 + j cos 6 , 9 being the true 
anomaly measured from the perihelion and * the unit vector along the major axis pointing 


Copyrighted material 




Central Force 147 


from focus to the perihelion. On differentiation, Eq. (4.49) gives, 


dA 

dt 


dv 

dt 


x H - K 


(4.50) 


since H is conserved for all types of central forces. Now, using the the vector equation of 
motion, dv/dt = - VV(r), where the total gravitational potential 


V(r) = - 7 + Vi(r) 


(4.51) 


with Vj(r) = the potential for the small perturbing noninverse square law of force, Eq. 
(4.50) can be reduced to 

ft < 4 - 52 > 


dA _ dA / de_ _ 2 dV l - 
d9 - dt / dt ~ T dr 9 

Therefore, the change in the direction of the perihelion, say A0, in the time in which 6 
changes from 0 to 2ir (that is, during one orbital revolution) is given by the magnitude of 
the change in A: 


\dA\k = \A\A<f>k = KeA<f> k = J*" l x d9 = j\ 2 cob 9d9 k 


noting that \dA\k = > x dA . So the angular velocity of precession of perihelion is 

^ = (4H) 


where 


Fi(r) = - 


dVi (r) 
9r 


«(1 ~ e 2 ) 
1 + ecostf 


and 


p = ^ 


2T V k 


Equation (4.53) is the most general expression for studying the effect on the motion of 
the perihelion of orbits in presence of any small perturbations of a perfect inverse square 
law of the main driving force. 


4.13.1 Applications 


(i) Precession of Perihelion of Equatorial Orbits of Earth’s Satellites due to the Flattening 
of the Earth 

The gravitational potential due to the flattened earth can be written as 


where 


V(r) = 


£ 

r 8 


a = GMq 


Copyrighted material 




148 Classical Mechanics 


and 

0 = \g(.C - .4)0 - 3cos 2 *) 

<f> = 90° - A, A being the latitude, A and C are the two principal moments of 
inertia of the earth (for its derivation, see section 12.25), (C — A)/A being the measure of 
the flattening. is positive for orbits lying close to the equator. 

The second term in V(r) is our Vi(r), which corresponds to a central force component 
having an inverse fourth power dependence on r, that is, 


Fx(r) = 


3 1 

r 4 


This leads to the angular velocity of precession of the perihelion of the orbit of any geocentric 
object, be it an artificial geosatellite or the moon, to be given by 




3/? 

0 »/ 2 a V 2 (l - e 2 ) 


(4.54a) 


Actually, the moon’s orbital perigee precesses due to the oblateness of the earth at a rate 
of about one complete revolution in every 8.8 years. The orbits of artificial satellites, being 
much closer to the earth and having appreciable eccentricity, can precess at a very fast rate, 
sometimes a few degrees of arc a day. Thus, if you spot a satellite moving across the sky 
in the twilight hours, it may be seen next day (or after a few days depending on the exact 
period of its revolution) but with a changed orientation in the sky. 

Not only the orbits of satellites around the earth, but also the orbits of planets around 
the sun would show this effect, since the sun must have an oblate spheroid configuration, 
however small, due to its axial rotation. Brans Dicke had once proposed that the solar 
oblateness will measurably contribute to the precession of the perihelion of Mercury’s orbit 
in space. When Dicke proposed it, sun’s oblateness was reported to be a few parts in 10" 5 , 
but the current estimates of the solar oblateness suggest an oblateness possibly no more 
than a few parts in 10” 7 , in which case, the effect is quantitatively negligible compared to 
other factors that lead to the precession of the perihelion of Mercury’s orbit. 


(*) The General Relativistic Correction to the Newtonian Force 

The general theory of relativity, as proposed by Albert Einstein in 1916, suggested that 
Newton’s law of gravitation is only approximately correct. When interpreted in terms of 
the force that acts on any test particle moving under the influence of the sun, the general 
relativistic equations of motion are found to contain some terms which are non-Newtonian, 
one of which is given by 


Fi(r) = - 


4 G 2 Ml 
r 3 c 2 


£ 

r 3 


(say) 


Hopefully, you can also derive this term without knowing much of the general theory of 
relativity, if you attempt to work out the problem no 6.9. The Keplerian orbits in general 
relativity are still planar in nature, but because of the above force term, there will be a 


Copyrighted material 



Central Force 149 


precession of the line of apsis, the angular velocity of precession amounting to 

np = 2{Kay/*(l - e 2 ) (4 ' 546) 

The sun is so massive that this kind of general relativistic effect is discernible in the 
eccentric motions of all the inner planets. Mercury’s orbit being closest to the sun and being 
highly eccentric compared to other planet’s, the magnitude of this effect is quite appreciable. 
Even though the concept of general relativity was not there in the nineteenth century, this 
effect in the form of a discrepancy was first observation ally estimated by Leverrier in 1860. 
By the turn of the nineteenth century, people had devised at least half a dozen explanations 
for this discrepancy, all of which turned out to be mere guess work and wrong, once Einstein 
came up with a prediction from his general theory of relativity in 1916, that exactly matched 
with this discrepancy, amounting to only 43 arc seconds per century. 


(Hi) Perturbation due to Other Planets on a Given Planet 

In the presence of a third object a Keplerian pair experiences a perturbative force due to 
the third object. Since there are nine planets in the solar system the motion of any planet 
is disturbed by the eight other planets. The long term effect on the perihelion motion of 
any planet will be due to these extra perturbations, suitably smoothened over sufficiently 
long period of time. 

We know that a lighted joss stick appears as a continuous streak of light when the lighted 
end is moved fast. Following this analogy, a planet of mass m orbiting in a circular orbit 
of radius R can be viewed as a ring of radius R with mass per unit length A = m/(2irR) 
to an observer who perceives centuries as short as blinks of an eye. The force field due to 
such a planetary ring can be shown to have the form 


Fx{r) = 


nGXr 
R 2 - r 2 


(4.55) 


where R is the orbital radius of the perturbing planet (or equivalently, of the ring), r is the 
radial distance to any arbitrary point from the centre of the ring, and A is the linear mass 
density of the perturbing planet along the ring (orbit). The value of the total perturbing 
force at the location of the perturbed planet (having an orbital radius a) then becomes, 

F,(a) = E A, 


giving 


fj(«) 




R 2 i + a 2 
(Rf - a 2 ) 2 


where the sum extends over all the perturbing planets: the ith one having mass m», orbital 
radius Ri and A i = miJ2nRi. The inverse square law of force due to the sun on the 
concerned planet is, of course, 

Fo = GM 0 


We now apply this formulation to the study of the perihelion precession of the orbit of 


Copyrighted 



150 Classical Mechanics 


Mercury around the sun. When put all the relevant numbers, taking from any elementary 
book on astronomy, the bracketed quantity in Eq. (4.48) becomes for Mercury’s orbit, 


F 0 


°F[ (a) 

2 F 0 


= - 9.84 x 10- 


due to all other planets, which results in (l p = 529 arcsec/century. The general relativistic 
correction as given by Eq. (4.54b) gives another 43 arcsec/century. The secular motion 
of the ecliptic adds further to this 2.3 arcsec/century. Precession of the origin namely the 
Vernal equinox (for definition see section 4.8, and for calculation see section 12.24) adds 
— 5029 arcsec/century. So the observed rate with respect to the moving equinox is 


529 + 43 + 2.3 - (— 5029) = 5603.3 arcsec/century 


However, the true advance of perihelion of Mercury in space (in an inertial frame) is only 
529 + 43 = 572 arcsec/century 


Thus, the general relativistic contribution to the rate of advance of the perihelion of 
Mercury is quite insignificant compared to the total amount that one observes while sitting 
on the earth. Even so, this effect is considered to be one of the five major observable effects 
that the general theory of relativity has proposed so far and the first one to be tested 
straightway at the time of the proposition of the theory in 1916. 


4.14 THE BASIC PHYSICS OF TIDES 

The shore line of water over any sea beach is seen to advance and recede twice a day. 
This phenomenon is known as ocean tides. We aim to formulate the laws of ocean tides 
in the present section. Newton was the first person to give the correct reasoning for this 
phenomenon. However, his treatment lacked rigour, and was satisfactorily improved by Sir 
G. N. Darwin in 1883. The treatment is far from simple, even though its essence can be 
brought out in terms of simple laws of physics. In some text books, a discussion of tides 
is available, but it is either too simplified, or erroneously presented. More modem and 
sophisticated treatments are provided by A. T. Doodson (1921) and G. Godin (1972). In 
fact, all the quantitative details of oceanic tides have begun to be understood only very 
recently. The long term effects of tides on the motion of the satellites is one of the current 
interests of planetary scientists, as it is connected to the origin and internal structures of 
the planets and satellites. Tidal effects are also important in the studies of accretion discs 
around massive black holes, neutron stars, etc. So it is desirable that we have some proper 
introduction to the basic physics of tides. 

A central force cannot be totally uniform inside any finite volume of space, be it far or 
near, large or small. Hence over the extent of any physical body, the applied gravitational 
force cannot be totally uniform. A rigid body can withstand this nonuniformity of force 
but a fluid body cannot. A fluid will gradually shift to a place where the applied force is 
relatively stronger. This is the basic cause of tides produced in any non-rigid body of finite 
size. 


Copyrighted 



Central Force 151 



Ho 


6 M 0 


Fig. 4.10 Origin of tidal bulges explained 


(i) Tidal Bulge 

Let us now see the effect of the inverse square law of gravitation. Let there be a freely falling 
small spherical observatory in space under the action of the gravitational force of the earth 
(see Fig. 4.10). The field outside the earth is effectively due to a point mass placed at the 
centre of the earth. Let H„ be the distance from the centre of the earth to the centre of 
the observatory O and r„ be the radius of the freely falling observatory. The downward 
acceleration due to gravity at 0 is given by 

g 0 = —jjr' (M® : mass of the earth] 


The downward acceleration due to gravity at the topmost point T of the observatory is 


S ‘ (//„ + *„)» “ S " (‘ H„ + ') < 9 " <4 ' 56) 

The downward acceleration due to gravity at the bottommost point B of the observatory is 


9b ~ (H a -* o )» “ 9o ( X + H 0 + '**) > 9o (4-57) 

Thus if a particle is released at O it will fall uniformly with the observatory with the 
downward acceleration g„ and always remain at the centre of the freely falling observatory. 
But if the particle is released at the top it will have less downward acceleration than at 0 
and therefore it will start lagging behind O, with the result that its distance from O will be 


Copyrighted material 



152 Classical Mechanics 


increasing with time with a relative acceleration 2 r 0 g 0 /H 0 . On the other hand, if the particle 
had been released at B, it would have accelerated at a relatively higher rate than the one at 
O, the difference in the acceleration being again 2 r„g„/H 0 . So the distance between T and 
B would gradually increase with time, while the centre and the equatorial plane remaining 
essentially unchanged. Or in other words, had the wall of the freely falling observatory been 
made up of fluid material or loose particles, the sphericity of the laboratory would have been 
destroyed and it would have assumed the shape of a prolate spheroid. This effect is called 
the tidal bulging of a large enough fluid sphere. The axis of the bulge is always aligned along 
the line of action of the externally acting perturbing central force, which again corresponds 
to the direction of the source of the tide raising perturbation. 

(ii) Tidal Forces on the Earth due to the Moon 

The earth is about 81 times more massive than the moon, their centre to centre separation 
is about 60 times the radius of the earth and their diameters are rouglily in the ratio 11:3. 
Since we are considering the tidal forces on earth due to moon, the surface of the earth now 
behaves like a freely falling observatory in the gravitational field of the moon. 



OPN : Zcnilh direction at P 
L NPO- Zenith distance of the moon 
a “ L OOP: Angle subtended at moon by the radius vector OP 


Pig. 4.11 The Gravitational force of the moon’s attraction on a particle on the surface of 
the earth, depending on the zenithal angle and distance of the moon from the 
location of the particle 


At any arbitrary point P on the surface of the earth (see Fig. 4.11) the magnitude of the 
gravitational acceleration due to the moon is 

_ Cm 


Copyrighted 



Central Force 153 


where m is the mass of the moon and 

R — distance O'P = \/Hj + r* - 2H 0 r 0 cos 6 
H 0 = 00', r 0 = OP (earth's radius) and 9 — ZO'OP 
Now the zenithal (or the radial) component of the acceleration at P is (see Fig. 4.11) 

fr = ^cos(0 + or) 

and the horizontal (or the tangential ) component is 

ft = - ^sin(« + °) 

which is algebraically positive in the direction of increasing 9. Using the following Taylor 
series expansion 

1 _ 1 T 2 r 0 cos 9 r 2 1 -1 1 I" 2r 0 cos 9 

& ~ H 2 l 5T” + Hl\ ” l 1 + ““ H7~ 

to the first order of approximation, and using 

r o sin0 r 0 sm9 . r -r"5— 

and cos a = v 1 - sin a ~ 1 


sin a = 


Ho 


and 


r 0 sin 2 9 
H 0 

r 0 sin 9 cos 9 


cos (9 + a) = cos 9 cos a — sin 9 sin a ~ cos 9 — 

sin (9 + or) = sin 9 cos a + cos 9 sin or ~ sin 9 + 

we get, for f r and /$, 

fr = ^ [cOS 9 - (1 - 3cOS 2 0) + ■- j 

ft = [- sin0 - ^~(3sin0cos0) + •••1 

The magnitude of the total acceleration at P due to the moon is given by, 

. Gm Gm [, 2r o cos0 1 , T 2r o cos0 

s - -W = Hf l 1 + -hT + ■ ■■ -J - u l 1 + -TT + ■ 

where f 0 = Gm/H 2 is the acceleration at 0 due to the moon at O'.'We can write f r and 
fe in terms of f a : 

fr = f.COtt - /.(^)d - 3cos 2 «) 


(4.58) 


(4.59) 


ft ~ — fc sin 0 — f 0 (3sin8cos8) 


(4.60) 


Copyrighted material 



154 Classical Mechanics 


These equations specify all the extra force components due to the presence of the moon 
in its orbit around the earth, acting at any point on the surface of the earth, apart from the 
earth’s own gravitational and centrifugal forces that act at the same point. Obviously, the 
magnitude of f r and fg are extremely tiny compared to g e ff, being only of the order of 2 
- 4 x 10-Vff- 

(in) The Equilibrium Tidal Heights due to the Moon 

The first terms in f r and fg are merely the two components of f a and they correspond 
to the same free fail acceleration of P as of 0. They are like the first term g 0 appearing in 
the expressions for g t and gt, in the example of the freely falling observatory. We have seen 
that this term does not correspond to the tidal force. However if the observatory were held 
fixed in space with respect to the force centre, then 0 would no longer have been accelerated 
and therefore g t and gt would have been fully manifested. Since neither the moon nor the 
earth is fixed in space, they are all truly freely falling observatories (remember that all free 
orbits including the circular ones are freely falling trajectories, falling due to the action of 
the centripetal force) and the first terms in the expressions for f r and fg correspond to 
the commonly shared acceleration of all points of the body, that is, of the earth as a whole. 
But by definition, tidal forces are the nonuniform part of the total force, so any constant 
force can be subtracted or added to the total force without affecting the tidal behaviour 
in the slightest. So the truly differential accelerations are given by the second term in the 
expressions for f r and fg and are called the primary terms of tidal acceleration. 

Again, by definition, f r acts radially at P and merely adds (insignificantly though !) to 
the local gravity of the earth so that the weight of all bodies resting at P increases by the 
amount [ — f 0 (r 0 /H 0 )( 1 — 3 cos 2 0)] per unit mass of the bodies and hence this component 
cannot produce any visible change in the value of g e ff, the change occurring only at the 
eighth significant digit. However, the other tidal component fg is not counteracted by any 
reaction force in that direction. This is therefore bound to produce a free motion of any fluid 
at P along the tangential direction. Hence the primary component of the tide producing 
acceleration is given by the second term in the second expression in Eq. (4.60), 

Mm. = -f^sinW (4.61) 

where 0 is roughly the zenith angle of the moon as seen from, the point P on the earth. The 
distribution of this force is shown in Fig. 4.12. The magnitude of the tidal force becomes 
maximum at 0 = 45° and 135° and it vanishes at 0 = 0°, 90° and 180°. Basically 
the fluid is pulled in two opposite directions in the two hemispheres divided by the plane 
passing through the centre of the earth and perpendicular to the earth moon synodic line. 
Fluid will tend to accumulate in both the sublunar and antilunar regions. 


The work done by the tidal force per unit mass of the fluid between 0 = 7t/ 2 and 


Copyrighted 



Central Force 155 



Earth 


Fig. 4.12 The angular distribution of the 0-component of the lunar tidal 
acceleration over the surface of the earth in a meridional plane of 
the moon 

0 = z = the zenithal angle of the moon when it reaches the liighest point in the sky, 



This work must be equal to - gh t u\ ci where hu a 0 is the height of tide measured from its 
lowest level 3 

= 2 (jJ^) (t^) r " cos ’* (4.63) 

~ 0.54 metres for z =0 

Thus as obtained in Eq. (4.63), about 0.5 m would have been the maximum tidal height 
(that is, the height difference between the low and the high tides) if the earth was not rotating 
(which is the condition for equilibrium tide) and was covered uniformly with water. Because 
of the fast rotation of the earth, the equilibrium tidal configuration is never achieved, there 
always remains a lag as a huge mass of oceanic water has to move physically from one region 
to another in order to keep the tidal bulge aligning as close to the direction of the moon as 
possible. The tidal lag is represented by an angle 6, which amounts to about 2.16 degrees 
of arc for the oceanic tides on the earth. 

There is another big effect due to the nonequilibrium nature of the oceanic tides. While 
the huge mass of water physically transports itself from one place to another, it interacts 
dynamically with the bottom surface of the sea as well as the topography of the shore line. 
During the process of filling and emptying various cavity like regions, the phenomenon of 
cavity resonance on large scale becomes an important factor in locally enhancing the height 
of tides over and above the height of the equilibrium tide. The enhancement due to the 
peculiarities of the land profile, resonance due to shallow water trapped near fjords, gravity 


Copyrighted material 



156 Classical Mechanics 


waves, etc. can be as large as a factor of 10 - 50 compared to the height of the equilibrium 
tide. The calculations of all these effects are not at all easy to perform, and therefore, some 
semiempirical fits for the amplitudes of different modes of tides are obtained for each place 
separately, which are used to prepare the charts for the tidal prediction. 

(iv) Why are there Two Tides a Day? 

The earth rotates about its spin axis passing through the geographical north pole once a 
day, while the moon orbits the earth only once every 27.32 days. Therefore every point on 
the earth’s surface, particularly the equatorial and midlatitude regions must pass through 
one sublunar and one antilunar bulge every day, thereby experiencing two tides a day. Since 
the moon advances by about 13° eastward every day, the sublunar point also advances in 
space by the same amount and the earth has to rotate this extra 13" (which takes about 
52 minutes) to reach the sublunar point the next day. Hence two consecutive high tides are 
on an average separated by 12 hrs and 26 mins and two consecutive sublunar high tides by 
24 hrs and 52 mins. The sublunar tides are known as primary tides and the antilunar tides 
as the secondary tides. 

When the tides over a given place achieve their maximum heights during any time of 
the day or night depending on the time of the upper and lower meridional passages of the 
moon over that place, they are called high tides and when the water recedes to the lowest 
levels such tides are called low tides. Usually the tidal heights are measured from the zero 
level set by the lowest possible of all low tides, observed through years. The amplitudes of 
the high and low tides are not the same every day. It follows a monthly as well as yearly 
cycle. The monthly variation of the tidal amplitude of the high and low tides is due to 
the interference of the solar contribution of tide to the lunar one. Since the sun and the 
moon do not come over the meridian at the same time everyday, their tides are not added in 
phase; so sometimes the effects are cancelling and at other times enhancing the lunar tide. 
The seasonal and yearly variations are due to the change in the relative orientation of the 
diurnal track of the moon over any place, and the variation of its distance from the earth. 

The height of the highest of all high tides on the east coast of the Arabian sea is about 
5.5 m during the quiet conditions of the climate. Inside the fjords, or between a big island 
and its nearest mainland, or during storms, the tidal heights can reach anywhere between 
10 m and 50 m. 

(v) Relative Tidal Heights due to the Moon and the Sun 

The relative tidal forces are determined essentially by the factor mr a /H*, where m is the 
mass of the tide raising body, r 0 is the radius of the object on which tide is raised and H a 
is the distance between them. Therefore, 

Tidal forces due to moon [W^o] moon _ Pm /U 3 , . 

Tidal forces due to sun [m/if*] 8UIl Ps \ 9, ) ' * ' 

where 6 m and 9, are the angular radii of the moon and the sun respectively (as seen from 
the earth) and p m , p t their average mass densities. Since the sun and the moon have 
almost equal angular sizes as seen from the earth, that is, 

e m st e. 


Copyrighted 



Central Force 157 


and therefore, 

Tidal forces due to moon on earth _ Pm _ 3.34 x 10 3 kg/m 3 ^ 

Tidal forces due to sun on earth p t 1.41 x 10 3 kg/m 3 ~ 

During the new moons and the full moons the sun, the moon and the earth are nearly 
aligned and the two tidal forces add to each other. But during the quarters, the sun and 
the moon are at right angles with respect to the earth and as a result produce an opposing 
tidal effect. The high tides during any new moon and full moon are called spring tides and 
are supposed to be (2.5 + 1.0)/(2.5 - 1.0) = 7/3 times higher than the neap tides that 
occur during the quarter moons. There are also seasonal and other long term variations in 
the tidal heights due to the variation in the maximum altitude and distance of the sun and 
the moon from any given place on the surface of the earth. 

(vi) Solid Tides 

Since no object is perfectly rigid, there will always be some tidal distortions in the solid 
earth too, due to the moon (and also the sun). But the distortions due to solid tides hardly 
amounts to a few centimetres and occur inside the body of the earth. However, for the 
satellites of Jupiter or of Saturn, these distortions (caused by the respective planets) can 
be quite substantial, so much so that no satellite can even survive inside a certain distance 
from the parent planet. More distant ones, such as the Io of Jupiter, can show volcanic 
effects purely due to tidal heating of their mantle. It is believed that the rings of Saturn, 
Jupiter, Uranus and Neptune have formed due to tidal disruption in the past of some of 
their regular satellites. The limiting distance from the planet within which a liquid satellite 
disrupts completely due to tidal effects is called the Roche limit which was first deduced by 
Roche and was found by him to be about 2.46 times the radius of the parent planet. In fact, 
all the major rings of Saturn lie within this critical Roche limit for Saturn. A simplified 
idea of how this limit can be derived has been outlined in the problem no 4.25. 

(tHi) Tidal Torque, Tidal Dissipation and Lengthening of Day 

Since the earth is rotating, the tidal bulge produced by the moon has to shift continually 
on the surface of the earth in order to keep itself up in phase with the moon. The axis of 
the bulge, which should always point toward the moon, has to shift with time at a rate of 
about 1° every 4 minutes carrying the required amount of oceanic water with it. In the 
process the motion of water experiences the viscous forces of drag within itself and at its 
contact surface with the sea bed. This has got two effects, (a) The tidal bulge lags behind 
the arrival of the moon on the meridian (Meridian is the vertical plane passing through the 
north south points), (b) This angular lag ( 6 ) is the cause of the tidal torque that acts on 
the earth and is of course produced by the moon. The moon experiences the reaction in the 
form of an equal and opposite tidal torque. The magnitude of the tidal torque is given by 

r “ iir ,inW (4-65) 

The constant of proportionality, called the Love number (usually denoted by * 2 ), differs 
from object to object; for the earth it is 0.9, for the moon it is about 0.06. The value of 
6 for the earth due to lunar tide is about 2°. 16. This torque results in a secular (meaning 


Copyrighted materi 



158 Classical Mechanics 


longterm) increase of the earth’s period of rotation by about 2.4 millisecond/century or 
about an hour every 160 million years. So about 160 million years ago, the length of the 
day was about 23 hours! However over smaller time intervals, say over a few months, years, 
or even decades, earth’s rotation period fluctuates with an amplitude approximately 100 
times larger than the above secular one. Such short-term fluctuations are possibly due to 
atmospheric motions, oceanic currents, tectonic movements inside the earth, moon’s orbital 
nutation, etc. Consequently we have to add or subtract 1 leap second every few months or 
even few years. However the secular slowing down of the earth’s angular speed of rotation 
will continue until a steady state is reached for which 6 = 0 (see Eq. (4.65), the tidal 
torque vanishes, that is, the tidal bulge is always pointing toward the moon). This will be 
identically satisfied only when the length of the day on earth becomes equal to the length 
of the month, which is the orbital period of revolution of the moon around the earth. The 
moon has already attained this state because of the tidal torques of far greater amplitude 
produced by the earth on the moon during its early phases, and now from the earth we 
always see the same face of the moon. The moon is tidally locked to the earth. Similarly, 
there will also come a day when the moon will see the same face of the earth. For this to 
happen, we may have to wait for another 4 billion years or so. 


4.15 SCATTERING IN A CONSERVATIVE CENTRAL FORCE FIELD 
(i) Scattering Cross-section 

Scattering is a process in which an entity changes its direction of motion due to its close 
approach to or encounter with an agent which interacts via its own force fields. Usually 
classical particles as well as quantum objects like photons, electrons, etc. are subject to 
scattering by suitable agents called scatterers. In any physical encounter, two things may 
happen; the particle is either scattered or absorbed by the agent. If there is any stream 
(or flux or beam) of particles, the total loss from the incident stream must be the sum of 
the losses due to scattering and absorption. This sum effect is called the extinction, and by 
definition, 

extinction = scattering + absorption 
Now we define a few quantities. 

Flux density: This is also called the intensity. It is the number of particles that are emitted 
in a direction implied by their velocity v, through unit area normal to v and per unit solid 
angle around the direction of v. 

If the incident beam is sent in a particular direction with some definite velocity v of each 
particle and the number density at any instant of time inside the beam, say n 0 , then the 
incident intensity I is defined to be 

/(«) = n 0 M 

= total number of particles passing through a unit area (4.66) 

normal to v per unit time. 

However, if the incident beam has got a spread in the solid angle then the incident intensity 


Copyrighted 



Central Force 159 


has to be defined as 

/(Q,v) = n(ft) |v| (4.67) 

For the unidirectional beams, after the interaction with the scattering agent some of the 
particles are deflected, some might even be absorbed by the agent itself and the rest will 
still follow the incident direction. The intensity of the last component called the emergent 
intensity is always less than or at most equal to the incident intensity, assuming that the 
speed of the particles remains the same at large distances from the force centre of scattering. 

Total scattering cross-section: The actual time rate of loss of particles due to scattering 
in different directions may be characterised by the effective loss of the area normal to the 
direction of the beam that originally contained these scattered particles. This loss of total 
effective area from the incident beam cross-section is called the cross-section of scattering. 
In other words, if the total cross-section of scattering is said to be ax, it simply means that 
the number of particles that were incident (per unit time) through the cross-sectional area 
ax are now missing due to the fact that they are continually scattered away in directions 
other than the incident beam direction. So by definition 


Total number of particles scattered per unit time 
Total number of particles incident per unit normal area per unit time 
Total number of particles scattered per unit time 
Incident intensity 


(4.68) 


Differential scattering cross-section: One may also define a differential scattering cross- 
section <7(fl) if the scattering in a particular direction is to be estimated. By definition, 


a(Cl)dCl = 


Total number of particles scattered per unit time into the solid angle 
dCl around the direction defined by Cl 

Incident intensity 


(4.69) 


Basically a(Cl)dCl corresponds to the effective loss of normal area from the incident beam, 
from which particles are continually scattered into the solid angle dCl around the direction 
implied by 0. It is easy to see that 




It is to be noted that in the spherical polar coordinates with its polar axis coinciding with 
the incident beam direction, dCl = sin 9 dO d<f>. If there is ^-symmetry in the problem d<f> 
can be integrated from 0 to 2ir and dCl can be expressed as a function of 9 alone, and 
it will no longer represent a particular direction but a particular conical section between 
9 = 9 and 9 = 9 + d9. In this case dCl becomes 

dfl = 2 tt sin 9 d9 (4.71) 


(ii) Scattering in a Conservative Central Force Field 

The scattering of individual particles is a two body problem, one partner of which is the 
force centre and the other partner is the particle itself. We know that a two body problem 


Copyrighted material 



160 Classical Mechanics 


can always be reduced to a one body problem with the net result that the scattering agent 
(force centre) can be assumed to be fixed in space and the scattered particle has to assume 
the reduced mass of the system. Using this one particle formulation we represent F as the 
force centre in Fig. 4.13, which remains fixed and the scattered particles fly past F following 
different trajectories. Since the particles are coming from infinity with some nonzero velocity 
v„, the original specific energy of individual particles E' = v*/2 > 0. If a particle has to 
avoid hitting the force centre directly it must not travel in the direction passing through the 
force centre when it is at infinity. Vectorially speaking, this would mean that r x v„ should 
not be equal to zero for that particle (r measured from the force centre). But r x v 0 = H, 
which is the specific angular momentum of the system and is a constant of motion for any 
kind of the conservative central force. This enables us to define a parameter s such that 

H = \H\ = v 0 s 

s is called the impact parameter of the particle and is simply the minimum distance of sep¬ 
aration between F and the trajectory of the particle, had there been no force field operative 
at F. We thus have 

H = v 0 s = sy/W (4.72) 

E', which is the specific energy (energy per unit reduced mass), is also a constant of motion, 
provided, of course, that the force field is conservative which we have assumed it to be. 



Pig. 4.13 Angle of scattering and impact parameter explained 


The angle of scattering, ip, is defined to be the angle between the incident and the 
scattered directions. Now, if 9 is the angle between the radius vector of the particle at any 
instant and its initial radius vector while at infinity or at very large distance, then 8 varies 


Copyrighted 




Central Force 161 


from 0 to 7r - From Eq. (4.22) we know that 

/ oo dr 

„,„rV2 H-HE' - V(r)} - r-> 

where V(r) is the potential at r due to the central force acting at F, and r in j„ = minimum 
distance of approach of the particle with respect to F (see Fig. 4.13). Obviously r m |„ ^ s, 
but in general r m = r m j„(5). Therefore, we get for the angle of scattering 


Xp = TT 



dr 

r^ s / c lE , H- i - 2 V(r)H~* - r~ 2 


(4.73) 


For a given incident intensity, v„ and hence E' are the same for all particles but the 
specific angular momentum H will difTer from particle to particle as the impact parameter 
s varies. From Eq. (4.73) it is now apparent that, as r, n j„ = r IH j n (s), 


rp = xp(s) 


that is, the angle of scattering is different for different impact parameters and the same 
for particles with the same impact parameter, provided E' (or v„) remains the same for 
all the incident particles. The geometry of this situation has got an azimuthal symmetry 
about the direction of v„ passing through the force centre F. Now, from the definition of 
the differential scattering cross-section we see that, due to azimuthal symmetry, the area 
lost from the incident beam due to scattering at an interval of the scattering angles xp and 
ip + dip, is the same as the area of the annular ring corresponding to the range of impact 
parameters a and a - da, so that, 


cr(xp)2irs\nxp\dxp\ = 27r a |da| 


or 

= _i_|* 

* sin xp I dxp 

where xp = xp(a) is given by Eq. (4.73). 


(4.74) 


(Hi) Scattering by Inverse Square Law of Force 

We know that E' > 0 corresponds to a hyperbolic orbit, for a force field given by 

F(r) = - K/r 2 , where K can be negative or positive according to the repulsive or 

attractive nature of the force field. In Fig. 4.14, the left hand track represents the path of 
the particle moving in an attractive type of the inverse square law of force fields, operative 
at F. For the same set of parameters, the right hand track would correspond to the path 
to be followed by the particle in a repulsive force field, operative also at the same location 
F. These two curves are symmetric about their common directrix, except that the signs of 
the curvature are the opposite. The equations for such a pair of orbits are given by (cf. Eq. 
(4.34) for the generality of the solutions) 


ecosi/ ± 1 


where the + sign corresponds to the attractive field and the — sign to the repulsive field. 


Copyrighted material 



162 Classical Mechanics 


The asymptotes r —» oo correspond to v —» v„ given by, co8v„ — ± e _1 , where the + 
sign is for the repulsive field and the - sign for the attractive field. For the attractive force 
field we have ip = 2i/„ - it, but for the repulsive field ip = it - 2 u„. Substituting the 
respective expressions for v„, we find for both the cases 


* = 



(4.76) 


Thus for both the repulsive and the attractive cases, the deflection is the same (which is 
obvious from Fig. 4.14), and it depends only on the eccentricity of the hyperbolic track of 
the particle under consideration. Using Eq. (4.37) in Eq. (4.76), we can eliminate e and 
express ip as a function of E' and H : 


cot 2 



- 1 


2 E'H 2 
~K~ 


Since H = v„s = s\Z2E', we can write for the impact parameter 


H _ Kcoi(ip/2) 
y/2~E' “ 2 E 1 


(4.77) 


Equation (4.77) represents our long sought for explicit functional form of ip(s ), a result 
obtained without performing the integration for evaluating the integral in Eq. (4.73). 


Using the expression given in Eq. (4.77) for s the RHS of Eq. (4.74) is evaluated to give 

*{1>) = \ csc 4 (V>/2) (4.78) 

Equation (4.78) is the Rutherford formula for the differential scattering of particles in the 
force field of an inverse square law of force (derived by Ernst Rutherford in 1909). 

Obviously, a(ip) given by Eq. (4.78) blows up as ip —» 0, the reason being that all 
particles are scattered, however nominal be their angle of scattering. Note that s -* oo 
as ip —» 0 implying that particles that are incident through an infinite cross-section are 
bound to be scattered through a vanishingly small angle ( ip ~ 0). Thus we have, 

a T = a (ip) 2ir sin ip dip —♦ oo, as ip„ —» 0 

J'l’n 

This divergence exists for any central force with /(r) oc r~ n , n being any positive 
exponent. 

For further study of small angle scattering, one is referred to two articles in the American 
Journal of Physics: volume 43, page 328, and volume 45, page 1122. 

(iv) Enhancement of Geometrical Scattering Cross-section due to Inverse Square Law of 
Attraction and the Cross-section for Capture 

We describe it by way of giving an worked out example. The idea is that, if a swarm of 
particles is intercepted by a rigid sphere, it is the geometrical cross-section of the sphere 
that causes the interception. But if the sphere is massive enough to have its escape velocity 


Copyrighted material 



Central Force 163 



Fig. 4.14 Symmetry of the paths of scattering in a repulsive (right track) and an attractive 
(left track) force fields of identical strength situated at F, for particles incident 
with identical impact parameter and identical initial speed 


compared to the velocity of projectile particles, then the effective cross-section of interception 
is enhanced. In fact, as would be shown below, the massive body can simply swallow all 
particles that have an impact parameter much larger than the geometrical radius of the 
body. The cross-section for swallowing is called the capture cross-section. 


Let the earth be streaming through a swarm of meteorites with a relative speed v„. 
Because of the conservation of specific angular momentum we get, 

Vo» = Vm r„ 

v m being the speed at closest approach (see Fig. 4.15), a the impact parameter, r D = 
r m in (= r© here) and 


2 2 2 Gm 2 2 

»; = < + = vl + vl 

• O 

by the law of the conservation of energy. Here v e is the earth’s escape speed at r 0 . Hence, 

7 T a 7 = nrl^l + j (4.79) 

So the capture cross-section is enhanced compared to its geomerical value by a factor > 1 . 
Since the sun is speeding through the local interstellar medium with a speed of about 23 


Copyrighted material 



164 Classical Mechanics 



Fig. 4.15 Critical impact parameter for gravitational cap¬ 
ture 

km/s (towards the constellation Hercules) compared to its own escape speed which is about 
617.6 km/s, it will gobble up everything that comes within a radius s % 26.9 /?©, that is, 
within about 27 times its geometrical radius, but for the earth v e = 11.2 km/s, and. hence 
this enhancement will hardly be 10% of its own radius. 

(v) Conserved Quantities in a Two Body Collision driven by a Central Force 

As we have seen in section 4.2, for the two body central force problem the velocity of the 
centre of mass (CM) is a constant of motion. Let r„ and r t denote the position vectors 
of the scatterer and the particle respectively and R be the position vector of the centre of 
mass at any given instant of time, as measured in some fixed inertial frame. Then the value 
of the expression 

MR = (m t + Tn f )R - + m t r t (4.80) 

remains constant for all time. Here, m„ and m t are the masses of the scatterer and the 
particle respectively. Now let p„ and p'„ be the initial and final momenta of the scatterer, 
that is, the asymptotic values of m a f, before and after collision. Similarly p t and p' t be 
the initial and final momenta of the particle. By Eq. (4.80) we get 

P, + p t = MR = p'„ + p' t (4.81) 

Equation (4.81) simply states the law of conservation of linear momentum for the process 
of two body collision. Note the additivity of linear momenta in Eq. (4.81). The principle 
of momentum conservation is the most general conservation principle for collisions as it 
is independent of the forces involved. If we assume that the forces operative between the 
scatterer and the particle are conservative in nature, the total energy of this system of two 
particles must also be conserved. 

In order to obtain this conservation law we consider the magnitude of momentum of one 
of the particles in the CM frame p\r\, where /* is the reduced mass of the system and 
r = r„ - r t is the relative displacement between the scatterer and the particle. (In the 


Copyrighted 



Central Force 165 


CM frame centre of mass is stationary, yielding its momentum to be zero while the value of 
r and hence f remains the same in all frames) It is easy to see that, 

pr = m,(f, - R) = - m t (r t - R) 

If p cm and p! cm are the initial and final values of fir then 

Pcm = P, - rn„R = - (p< - m t R) 
and p' cm = p' t - m,R = - (p' t - m t R) 

Now if the forces are conservative the energy of the two body system must be conserved 
and in particular, the asymptotic values of the energy must be equal. Thus, we have 

Pcm = Pcm (4-84) 


(4.82) 

(4.83) 


Equation (4.84) expresses energy conservation for the system. It can be expressed in terms 
of the individual particles’ momenta by using the relation 

= \ M & + \^ 2 (4-85) 


Equation (4.85) follows from Eq. (4.82) and holds irrespective of whether energy is conserved 
or not. Evaluating Eq. (4.85) in the asymptotic region and using Eq. (4.84) we get the 
energy conservation in the form 


_pL + A. = A. + A 

2m, 2 m t 2m, 2m f 


(4.86) 


A collision is said to be elastic if it obeys Eq. (4.86), that is, if it conserves kinetic energy 
and the masses involved. Equations (4.84) and (4.86) apply to any binary elastic collision. It 
is obvious from Eq. (4.84) that an elastic collision simply rotates the initial CM momentum 
p cm through some angle i/> into its final value p ! cm . Thus, we can write, 


P'cm = Pcm COS + (p cm X •) rin^ 


(4.87) 


where t is the unit vector in the triad (», j , fc), j and k defining the plane of scattering. 
The angle ^ is called the CM scattering angle. The velocities of the particles with respect 
to the CM are f, — R and r t — R respectively, which are always oppositely directed 
according to Eq. (4.82). Thus relations between the initial and final states can be shown as 
in Fig. 4.15. 

(vi) Momentum and Energy Transfer in Scattering 

The momentum transferred between the scatterer and the particle say Ap, can be obtained 
from Eq. (4.83). We have 


AP ■= p' cm - Pen = P'. - P. = - (P't - Pt) (4-88) 

Thus Ap is independent of CM velocity R. However, the energy transfer A E defined by 

AE = iLy' ~ P - ) = 2m7 (p ’. - p -> (4.89) 


Copyrighted material 



166 Classical Mechanics 


depends on the CM velocity R , because, again with the help of Eq. (4.83), we can express 
A E in the form 

A E = -RAp (4.90) 

Equation (4.88) tells us that A E is positive if the scatterer loses energy in the collision, 
and negative if the scatterer gains energy. 

Equation (4.90) has important applications in astromechanics as well as atomic physics. 
A spacecraft traveling from the earth to any of the outer planets, say Saturn, Uranus, 
Neptune or pluto, can be given a large boost in velocity by scattering it off Jupiter. For 
the Jupiter-spacecraft system we can place the origin of the ‘Lab frame’ on the sun so that 
R is nothing else but the velocity of Jupiter relative to the sun. Since the mass of Jupiter 
is too huge compared to that of the spacecraft, the CM stays on the Jupiter. In order to 
maximise the boost available to the spacecraft we must maximise R Ap = R‘(p' cm - p cm ). 
However, the initial momentum of the spacecraft, p t as it is launched from the earth also 
fixes p cm through the equation, 

Pcm = Pt ~ m tR 

Since p cm is fixed the maximum boost is achieved by adjusting the impact parameter for the 
collision so that p' cm is parallel to J2, giving R • p' cm its maximum value. This is achieved 
by appropriate timing of the launch and maneuvering of the spacecraft. Note that this is an 
instance of scattering by an attractive inverse square law of force due to Jupiter’s gravity. 
Since the force is conservative and the scattering is elastic we can use the Eq. (4.90). The 
amount of energy gained by the spacecraft equals the amount of energy lost by Jupiter. 
However, this energy is too small to perturb Jupiter’s orbit around the sun in any way. In 
other words, Jupiter and the spacecraft can be considered to form a closed system, as far 
as the scattering process is concerned. 

A gravity-assisted trajectory from the earth to Uranus is shown in Fig. 4.16. The transit 
time from the earth to Uranus is about 5 years on the assisted orbit as compared to 16 years 
on an unassisted orbit with the same initial conditions. 

(vii) Relation-Between Lab Scattering Variables and CM Scattering Variables 
Until now we were primarily concerned with the effectively one body problem, of scattering 
of a particle with a fixed centre of force. In an actual laboratory experiment scattering 
involves two bodies and the scatterer or the target particle is not stationary, but recoils as 
a result of scattering. The scattering angle, actually measured in the laboratory, say xf>\ 
(see below) is the angle between the final and incident directions of the scattered particle. 
The scattering angle 0, defined in the one body problem is the angle between the final and 
initial directions of the relative vector r = r, — r t of the two particles. These two angles 
would be the same only if the second particle remains stationary throughout the scattering 
process. Thus it is imperative, to establish quantitative relationships between the scattering 
variables in the CM frame (which corresponds to the equivalent one particle problem) and 
the scattering variables in the lab frame. 

In a typical scattering experiment one particle with initial momentum p t is shot at a 


Copyrighted material 



Central Force 167 



Fig. 4.16 A schematic orbital configuration for supplying gravita¬ 
tional boost to a spacecraft while it passes by Jupiter 




Fig. 4.17 Kinematics of binary collision in the lab and CM frames 

target particle at rest in the laboratory with initial momentum p„ = 0. Both p r and p t 
are measured with respect to a frame fixed in the laboratory in which the experiment is 
carried out. Fig. 4.17 shows the scattering variables which can be measured in a laboratory. 
Angle if)' is called the lab scattering angle and <f> is called the recoil angle. With the lab 


Copyrighted material 




168 Classical Mechanics 


condition p„ = 0, the first of Eq. (4.83) gives the relation between lab and CM momenta 
before collision: 

V, = ~~"Pcm = MR (4.91) 

m, 

Using this to eliminate R from the second of equation (4.83) and solving for the final LAB 
momenta in terms of the CM momenta we get, 

(4.92) 

(4.93) 


P', = Pern - P'c. 


Ap 


Equations (4.91-93) describe all the relations between lab and CM variables. These 
relations, along with the conservation laws are depicted in Fig. 4.17. 

Let us now express the lab energy transfer A E and LAB scattering angle rj>' in terms of 
the CM variables. The total energy E„ of the two particle system equals the initial kinetic 
energy of the projectile. This, in turn, can be expressed in terms of the CM momentum 
using Eq. (4.91). Thus, 

* = & = £& < 4 - 94 > 
Since the total energy is conserved, during the collision, E„ (see Eq. (4.93)) will be redis¬ 
tributed among the particles. Since the scatterer is at rest initially, the energy transferred 
A E is equal to its kinetic energy after collision. Using Eqs (4.89) and (4.93), this fact can 
be expressed as 

_PL = (P , cn - Pen,) 2 
2m, 2m, 

The fractional energy transfer is, therefore, 

= MP'cn ~ Pen,) 2 

E 0 


A E = 


(4.95) 


Mpl„ 

Now {tf cm - p cm ) 2 can be written as 

(p'c, - Pen.) 2 = P'L + Pen, - 
By Eq. (4.84) this becomes 

(p'em - Pen,) 2 = (1 - COS^) 

which finally gives 

4m t m, 


AE MP'en, ~ Pen.) 2 

Eo MpL 


(m« + m.) 2 


2 Pc.n Pen, COsV> 

= 4pL sin 2 (^/2) 

sin 2 (^/2) 


(4.96) 


Since Eq. (4.96) is independent of CM velocity R, it must apply to moving targets as well 
as the stationary targets as we have considered here. 


Copyrighted material 



Central Force 169 


Note that the energy transfer is maximum for ij> = n giving 


A E\ 4 m 9 m t ^ 

E 0 ) m „ (m. + m t ) 2 ~ 


(4.97) 


This means that all of the energy can be transferred to the target (scatterer) only if m t = 
m t . Thus, for example, hydrogen rich materials are more effective in slowing down neutrons 
than heavy materials like lead, because the mass of a hydrogen atom is nearly the same as 
the mass of a neutron. Electrons passing through a material lose most of their energy to 
other electrons in the material, rather than atomic nuclei. Since the proton to electron mass 
ratio is about 1836, the maximum energy transferred by an electron to a nucleus with n 
nucleons corresponds to 


/ AE\ = 0.002 

V E 0 / max n 


In order to relate lab scattering angle to the CM scattering angle we treat the plane 
containing all the momentum vectors to be the complex plane. Further, let a hat on a 
vector denote a unit vector in the direction of that vector. Then the lab and the CM 
scattering angles can be defined through the relations 


P'cm = Pen, exp (tV>) 

P'i = Pi ex P where i = y/—\ 


(4.98) 


From Eq. (4.91) we see that 


so if we take a scalar product of Eq. (4.92) and p cm and introduce the scattering angles 
defined through Eq. (4.98) we get 


p' t exp(*y) = pcm (^- + exp (tVO^ 

To eliminate p\ and p cm from the above relation consider 

(£) = fe + expW) ) fe + exp( - ,v) ) 

i . ( m * V , 2m < / 

= 1+1 — 1 + -cos t/> 

\m.) m, 

which, on substituting back into Eq. (4.99) gives 


exp(iV»') 


{m,/m ,) + exp(t~V>) 


[l + ( m t /m a ) 2 + 2 (m ( /m,) cos V 


(4.99) 


(4.100) 


Equating the real and imaginary parts of Eq. (4.100) and taking their ratio, we get finally, 


tan rf)' 


sin xl> 

m t /m a + cos 0 


(4.101) 


Copyrighted 



170 Classical Mechanics 


To interpret this formula for m t < m t refer to Fig. 4.17. For fixed initial CM momentum 
p cm the final CM momentum p' cm must lie on a sphere of radius p cm . It is clear from Fig. 
4.17 that there is a unique value of rp' for every value of rp in the range 0 < rp < r 
covering all possibilities. For the limiting case of the stationary target (m< m $ ) Eq. 

(4.101) reduces to tan rp' = tan rp whence rp' = rp. For the case of equal masses m, = m t 
the origin 0 in Fig. 4.17 lies on the circle and Eq. (4.101) reduces to tan^' = tan(V>/2) 
whence rp' — rp/ 2. 


For a light target m t > m # , the origin 0 lies outside the circle as shown in Fig. 4.17. 
In this case there are two values rp i and tp 2 of the CM scattering angle for each value 
of the Lab scattering angle. The two values rpi and rp 2 can be distinguished in the lab 
by measuring the kinetic energy of the scattered particle. The lab scattering angle has a 
maximum value rp' mAX given by 

s>n(^' m4x ) = ^ (4.102) 

m t 

which can be read off from the figure. Eq. (4.102) tells us, for example, that a proton cannot 
be scattered by more than 0°.03 by an electron. Therefore, any significant deflection of 
protons or heavier atomic nuclei passing through matter is due to collision with nuclei rather 
than electrons. This is how Lord Rutherford could conclude about the existence as well as 
the size of a heavy nucleus in an atom, from the experiments he did on the scattering of 
a-particles by the atoms of gold. 

(vtii) Lab and CM Cross-sections 


We have already evaluated the differential scattering cross-section in terms of the CM vari¬ 
ables (see Eq. (4.74)). In order to obtain a relation between the CM cross-section and the 
lab cross-section consider the real part of Eq. (4.100) giving the relation between scattering 
angles: 


cos rp' 


(mt/TTlt) + cos rp 


[l + (mi/m,f + 2 (mi/m,) cos^] 


1/2 


(4.103) 


Lab scattering through an angle ip' into du> = sin tp'dxp'dp corresponds to CM scattering 
through an angle tp into dCt = sin tpdipdp. The relation between the two functional forms 
of the differential cross-section say a (ip) and cr'^xp') is obtained by observing that in a 
particular experiment the number of particles scattered into a given solid angle must be the 
same whether we express the event in terms of rp' or xp. Therefore, 

2nla{tp)s\nrp \drp\ = 2n I o'(ip 1 ) sin xp' \d\p'\ 


or 


, sin ip dtp 


= *( 0 ) 


d(cos rp) 
d(cosxp') 


(4.104) 


Copyrighted 



Central Force 171 


From Eq. (4.103) we obtain 
dm _ d(cosip') 


1 + (m t /m„) cos ip 


dn d(cosip) Jj + (m t /m,) 2 + 2 (m t /m,) cosV’] 
_( 1 + ( m t /m„ ) sinV >) 1/2 _ 


^rni/m,) cos^ + ( 1 + (mt/m,) sinV>) 1/2 ] 
For m„ = m t this reduces to, 

du)_ _ 1 _ 1 

dfi 4 cos \ip 4 cos ip' 


(4.105) 


(4.106) 


For a heavy target m, > m t the Eq. (4.105) reduces to unity, and hence the lab and 
CM cross-sections are nearly equal. For m t = m t , 

<r'(ip') = 4 cos ip'<r{ip) 

Even when scattering is isotropic in terms of ip (that is, <r{ip) is a constant independent 
of V) the cross-section in terms of ip' varies as the cosine of the angle! Note, again, that 
ip' < rc/2 when m t = m,. 

Lastly we point out that <r(ip) is not the the cross-section that an observer in the CM 
frame would measure. Both <r(ip) and <r'(ip') are cross-sections measured in the Lab frame 
and are merely the expressions for the cross-section in terms of different coordinates. An 
observer sitting on the centre of mass would see a different incident intensity from that 
measured in the lab frame. Therefore, the corresponding transformation has to be included 
to relate the cross-sections as measured in the CM and the lab frames. 


4.16 VIRIAL THEOREM 

The virial theorem expresses a conservation law for a system of interacting particles, which 
has already achieved a dynamical equilibrium. The virial theorem was first given by Rudolf 
Clausius in 1870. The interaction is assumed to be of a central force type in general. 

Consider a system of N interacting particles that has achieved dynamical equilibrium 
through some kind of central force interactions. Let T be the total translational kinetic 
energy of such a system and W be the virial of the system defined as 

N 

W = £ • r » (4.107) 

« s 1 

where, is the net force on *th particle and r* is the position vector of the same particle. 
The virial theorem states that under dynamical equilibrium, 

2T + W = 0 (4.108) 


Copyrighted material 



172 Classical Mechanics 


Proof: The equation of motion of the ith particle is 

<Pri 

m * Hi? ~ Fi 

Now consider the virial of the system 

N N 

W = = Y' 


(Pri 

' ~dP ‘ 


v—> d / in \ r—> dri 

5 *•■•••■<) - 2T 


-—(I) - 2T 
2 dP { } 


dr, 

~dt 


where I = moment of inertia of the whole system about the origin, and T = total 
translational kinetic energy. If the system has achieved dynamical equilibrium its moment 
of inertia should not change with time so that dl/dt = 0 = <Pl/dt 2 , giving 


W + 2T = 0 


and therefore, proving the theorem. The term cPl/dt 2 does not vanish till process of 
virialisation continues. The time scale over which this process of virialisation takes place is 
called the virial time scale of the system. Virial W is connected with the total potential 
energy of the system. If the central force law between any two particles t and j is of the 
type 

Fij = force on the ith particle due to the jth particle 
= fij(n - rj) i ± j 


where 


fa 


ku 


i + j 


|r, - rj |" 

and where kij may depend on m,-, mj etc., but not on the mutual separation of the 
particles or on time. We can write the virial due to these forces as 


w = YE 

i j*i 


w = YY ( r « - r >) • [^ r< - r >) + §( r < + r >)] 

= \ YY A»l r * - r ;i 2 + \ Y Y M r * - r i) * ( r< + n) 


Copyrighted material 



Central Force 173 


The second term vanishes because the summand is the product of antisymmetric and sym¬ 
metric expressions with respect to (i, j). Therefore, 


W = 5 E E - r i \ 2 = \ E E | r . .1.-1 = (" - 

' 3 * i » i * « ' 3 

:re r _ r 

v = 5 EE£ ** •«'(»•<- n) 


1) F (4.109) 


is the total potential energy of the system. So, in the absence of any external forces the 
virial theorem for a system of N mutually interacting particles in dynamical equilibrium is 
given by 

2T + (n - 1) V = 0 (4.110) 


provided the force of interaction between any pair of particles follows a law |F;j | oc |rj — 
ryl" ". If the forces of interaction obey the attractive inverse square law, from Eq. (4.109) 
we get W = V and hence 

2T + V = 0 (4.111) 


This is a special case of the virial theorem (Eq. (4.110)), which, in turn, is a special case of 
the most general virial theorem (Eq. (4.108)). 


Examples 


(i) A virialised homogeneous and spherical cluster of galaxies (or of stars) of the cluster 
radius R and the cluster mass M has a gravitational potential energy V and total kinetic 
energy T , given by 

V = T = \Mv 2 

5 R 2 


where v 2 is the mean square speed of the individual galaxies (or stars). Thus from the 
virial theorem (Eq. (4.111)) we get 


M = ~ (4.112) 

If we can measure the mean square speed from the observed Doppler spread of the spectral 
lines and the radius from its known distance and measured angular size of the virialised 
cluster, we can infer the mass of the cluster from the above formula. 


(ii) Consider an ideal gas. It does not have any forces acting between its particles. The 
only force on the particles is the reaction force due to the impact on the walls of the container. 
For this force we can write 

dF = — p'dA' 

where p' is the pressure of the gas and dA' an element of area on the surface of the 
container (the direction of dA' being chosen -fve for the outward normal). Thus we get, 


Copyrighted material 



174 Classical Mechanics 


for the virial, 


W 


= -p' jr-dA! = -p' j (V r) dV' 


- 3 p'V' 


where V' is the volume of the container. From the most general virial theorem, we have 
2T + W = 0 

so that, 


T = 


\w = \p'V 
2 2 


Now it is well known that the total translational kinetic energy of an ideal gas comprising 
N particles is ( Z/2)NkT' where T' is the temperature of the gas and k , Boltzmann’s 
constant. Substituting this in the above equation we get the well-known result namely the 
equation of state 

p'V' = NkT' 

(iii) Let us apply the results of example (ii) to a star. A star is supposed to be made 
up of ideal gas. Let a small mass dm of a perfect gas consisting of dN particles have 
translational kinetic energy associated with it, given by, 


dT 


Ip' dV' = IkT'dN = l# T' — 
2 y 2 2 n 


where p is the mean molecular weight and R' is the gas constant. (Note: dN 
Na = Avogadro ’8 number). Now since, 

C p - C v = R! = kN A 

and C V T'/p = internal energy per unit mass = dU/dm we can write dT as 


Na dm/p , 


dT = \(C, 

3 , 


_ r )T ,dm _ 3 C V T' 

Cv ’ p 2 p 


«-) 


dm 


= - (7 - 1 )dU 


Therefore, for the whole star, 

T = |(7 - m (4.113) 

7 being the ratio of the two specific heats, C p and C v . 

For an ideal monatomic gas 7 = 5/3 giving T = U. This result is obvious from the 
fact that for a monatomic ideal gas all the internal energy is translational kinetic energy of 
the atoms. For diatomic gases, 7 = 7/5, giving T = (3/5)17, and so on. 

But in general, for any gravitating system, such as a star, the virial theorem (Eq. (4.111)) 
gives 

2T - V 

V = <s = x;- ^-77 (4.114) 


3(7 - 1) 3(7 - 1) 


Copyrighted material 



Central Force 175 


Therefore, the total internal energy 

E = U + V = 7 ~ 4 f 3 V (4.115) 

7-1 

For 7 > 4/3, E < 0 (since V < 0) and therefore the star is stable. A star is unstable 
if 7 < 4/3. Hence a gaseous planet entirely made up of triatomic gases such as CO 2 , H 2 O 
or NO 2 with 7 = 9/7 will become virially unstable. 

(iv) The virial for a system of charged particles moving in a magnetic field is given by 

The magnetic virial W m = Fj • rj 

where Fj is the magnetic force on the ith particle having charge e,, given by Fj = 
ej (t>j x B ), B being the magnetic induction. Therefore, 

N 

Wm = Y c « r «' ( w « x B) = Y —("*.»*. X Vi) B = — L B (4.116) 

“ mj m 

provided all changed particles have the same e/m ratio. L is the total angular momentum 
of the system. This virial should be added to the gravitational virial, if any. 


4.17 SUMMARY 

Centred force is a class of prescribed laws of forces that defines a point acting as a central 
origin of force, the magnitude of the force being a function of only the distance from this 
origin and the direction being either parallel (repulsive) or antiparallel (attractive) to the 
radius vector. For the conservative central forces, the energy and the vector angular mo¬ 
mentum are the first integrals of motion. The two body problem can always be reduced to a 
single body problem but at the cost of the conservation of linear momentum and the inertial 
nature of the frame. A third integral of motion belonging to the class of the first integrals, 
called the Runge-Lenz vector, exists only for two special types of central forces: one being 
the inverse square law (both attractive and repulsive) and the other Hooke’s linear law. 
By Bertrand’s theorem, all stable and bounded orbits become also closed only under these 
two types of central forces. Any deviation from these two types of the force laws makes 
all noncircular orbits precess about their apsidal lines and fill the space densely like the 
space-filling curves. 

Newton derived the law of gravitation from Kepler’s laws of planetary motions combined 
with his own laws of motion. Later on, Kepler’s laws were also derived back from Newton’s 
laws of motion and of gravitation. In the theory of planetary motions, the product of the 
gravitational constant and the mass of the central body ( GM) comes always as a unit, 
and therefore, neither the gravitational constant nor the mass of the central body can be 
determined separately. Planets which do not have any satellites could not have their masses 
determined except from the weak perturbations that they produce on other nearby planets. 

It is easy to fix the orbit of a planet from the knowledge of the five time independent 


Copyrighted material 


176 Classical Mechanics 


constants of motion, but not so easy to calculate the exact position in the orbit at a given 
time. This requires the solution of Kepler’s equation which is not integrable in closed form 
except for the parabolic orbits. 

The transfer orbits between two orbits are most economic if the firing from the inner 
orbit takes place at the perigee rather than at the apogee. The geostationary orbit is a 
unique orbit around the earth and is a very special case of the infinitely many possible 
geosynchronous orbits. 

Tides are produced on any extended body basically due to the nonuniformity of the 
(central) force of the tide-raising source over the physical extension of the body, the rear 
parts being less attracted than the central part and the central part being less attracted 
than the front parts. Only fluids and plastic solids can respond to tidal forces. It is the tiny 
tangential component of the tidal force that acts on the fluid elements, makes it work against 
the local gravity and gain height. The tidal bulge actually fills the deformed equipotential 
surface, which assumes the shape of a prolate spheroid in presence of an orbiting satellite. 

Scattering in the field of a central force is a process that makes an incident beam of 
particles lose the sense of their original directions. The actual area of cross-section of the 
incident beam that is lost once the beam is well past the scatterer, is called the total cross- 
section of scattering. The differential cross-section is the loss of area from the incident 
beam per unit solid angle of the angle of scattering. Rutherford scattering in a field of force 
obeying an inverse square law of distance suggests an extremely rapid increase in the angle 
of deflection at close enough encounters, which enabled Rutherford to measure the smallness 
of the atomic nuclei. In astronomical situations with attractive force fields, the sun-like stars 
gobble up, instead of deflecting away, practically anything that is originally targeted within 
a radius of about 25 times the radius of the star. The scattering of any small object such 
as a spaceship by Jupiter can be so arranged that Jupiter’s orbital momentum is imparted 
to the spaceship profitably enough to send the latter out of the solar system. 

The virialisation of a dynamical system can be viewed as a dynamical process of relaxing 
in which an equilibrium ratio between the kinetic and the potential energy is achieved. 
This equilibrium ratio depends on the exponent of the power law of central forces. For 
the gravitational fields, the equilibrium share of the kinetic energy is only one half of its 
potential energy, no matter how one prepares the original system, subject however to the 
condition that the system does not disperse away before it is virialised. 


PROBLEMS 

4.1 A radially stretched and self supported vertically hanging rope is in circular orbit 
around the Earth’s equator. Find the length, maximum tension and total energy of 
the rope, if its lower end remains hanging just over the ground and never touches. 

4.2 Consider a Keplerian two body system having unequal masses. Set up the Lagrangian 
of the system with respect to the centre of mass frame. Show that both the particles 
describe respective conic sections but with identical eccentricity. Find the share of 
the angular momentum and energy for each particle. 


Copyrighted material 



Central Force 177 


4.3 (a) Find the conditions for stability of circular orbits in a screened Coulomb potential 
given by 

V(r) = - ^-expj- K = Ze 2 >0 a > 0 

Illustrate graphically the radial variation of the effective potential. 

(b) How are the stable circular orbits on the surface of a cone, if the cone is put upside 
down with its vertex pointing downward? 

4.4 Suppose an astronomer finds a light object orbiting a motionless heavy one in a circle 
that has the heavy one located on its circumference, instead of at the centre. Deduce 
the force law and the system’s energy. Assume H a = const. 

4.5 Show that if the equation of the orbit is 9 = f(r) and the force is central, that is, 
r 7 9 = constant = H , the radial acceleration is given by 


f - r9 7 


2 . + nr) +1 ) 

r> \r 2 [/'(r)] J + r[/'(r)]» + */ 


Find the exponent of the power law nature of the central force if the orbit is 

(i) spiral, that is, 9 a r~ 1 and 

(ii) r = a tanh {9/y/ 2) 


4.6 (i) If jerk j is defined by j = «(<), find an expression for jerk for Keplerian orbits. 

(ii) We know that the speed, acceleration and angular speed of a planet are all max¬ 
imum at the perihelion and minimum at the aphelion. Find the locations where 
angular accelerations and decelerations are maximum. 

(iii) Show that the synodic period (return to the same configurational positions rela¬ 
tive to the earth and sun) S for both superior (outside the earth’s orbit) and inferior 
(inside earth’s orbit) planets is given by 


s 7 — a 

(1 - a 1 - 5 ) 2 

where a is the semimajor axis of the planet in AU and 5 measured in yr. 

4.7 Show that the equivalent of Kepler’s equation for the parabolic orbits is analytically 
integrable. Use the result to calculate how long a comet approaching the sun in a 
parabolic orbit would spend inside earth’s orbit (radius = 1 AU = 1.496 x 10 n m), 
assuming the perihelion distance of the comet from the sun to be 0.587 AU. Take 
M q = 1.987 x 10 30 kg. 

4.8 (a) Derive Kepler’s equation analytically with the definition of E given by Eqs (4.42). 
(b) Expand it in terms of g and find a series expansion for v in terms of g and 
eccentricity e. 

4.9 Would an electron exposed to solar radiation pressure and gravity be expelled from 
the solar system? What is the critical luminosity for doing this job ? 


4.10 Calculate the minimum velocity a spacecraft needs in order to escape from the solar 
gravitational field, starting from the surface of the earth. 


Copyrighted material 



178 Classical Mechanics 


4.11 As the moon is revolving round the earth and the earth round the sun, one would 
expect the moon’s path to be convex towards the sun while the moon is in her crescent 
phases. Show that this can actually never happen, even under the circumstances of 
the new moon. 

4.12 Find the time-average of r, 0-average of r and the arc-length-average of r for a 
Keplerian elliptical orbit having semimajor axis a and eccentricity e. 

4.13 Science fiction writers such as John Norman writing ‘Chronicles of Counter Earth’, 
have described a sister planet which shares the same orbit with the earth. When the 
earth is at perihelion, the sister planet is at aphelion. How could earthlings launch 
a spacecraft to explore this counter earth? (The problem is to find a satisfactory 
transfer orbit). When the earth is at one end of the semilatus rectum (given a = 1.00 
AU and e = 0.01674), find the angle between the sun and the counter earth. 

4.14 A space vehicle moving in a circular orbit of radius n transfers to a circular orbit of 
radius r 2 by means of an elliptical transfer orbit, called the Hohman transfer orbit. 
With what speed should the spacecraft be launched from the earth’s surface in order 
to send it to Mars in least possible time ? How long is the in-flight time ? 

4.15 It is planned to launch a satellite (of mass m ) in a circular orbit in the central inverse 
square gravitational force field of the earth. The satellite is to be ‘rocketed’ to a point 
P at altitude h and given the proper value of the kinetic energy with direction of 
motion perpendicular to the line joining P and the centre of the earth. Everything 
goes as planned, except the direction of motion is at the angle 0 ^ 90°. Show that 
(a) the orbit is an ellipse with its semimajor axis length equal to the radius Rq + h 
of the intended circular orbit (R© = radius of the earth), (b) the point P is at one 
end of the minor axis of the ellipse, and (c) the eccentricity of the ellipse is cos 0. 

4.16 How large is the classical deflection of the light rays that grazes past a neutron star 
(mass = 3 x 10 30 kg, radius = 10 km)? 

4.17 Show that elastic scattering by a hard paraboloidal surface follows the same law of 
angular distribution as that of the Rutherford scattering in a repulsive inverse square 
law of force field. Find the effective focal length of the latter. 

4.18 The supernova 1987A is situated about 170,000 light years away from the earth and 
expected to have released 10 48 J of energy in the form of neutrinos. If the average 
energy of neutrinos is about 10 MeV and the Oerenkov detector containing about 2000 
metric tons of water at Kamiokande in Japan catches about 10 neutrinos coming from 
the supernova through reactions of the type P e 4- p -» n + e + , find the cross-section 
for this neutrino (actually antineutrino) reaction. 

4.19 Show that the difference between the scattering angles in the lab frame and the CM 
frame is sin - ^xsin^ab), where x = the mass ratio of the scattered particle and 
the force centre. 

4.20 Find the angular frequency of the precession of the pericenter of the orbit of any 
planet if it experiences a small perturbation due to the following noninverse square 


Copyrighted material 




Central Force 179 


law forces: 

(i) Yukawa type of short range potential V(r) = - *e -r / d , where r » d, K and 
d are constants. 

(ii) Hall type of potential V(r) = - K/r l + (6 < 1), 6 being a small constant. 

(iii) V(r) = - K/t - K'/r*. 

4.21 Given the Runge-Lenz vector A = {-2pE)~^l 2 {Krlr + L x p), where L = 
r x p, K = GMm , \i - mM/(m + M), and energy E = mv*/2 + V(r). Now 
if the force field is a combination of inverse square and inverse cubic attractive forces, 
say F(r) = - K r/r 3 - K'r/r 4 , we know that the perihelion will be processing at 
a steady rate. This means that it is possible to eliminate precession and the effect of 
the cubic law of force if we mov e to a suitable rota ting frame of reference. Show that 
L' 2 = L 2 — K', and e' = yj\ + (2 EL' 2 /K 7 ), where L' and e' are the angular 
momentum and the eccentricity as seen from this rotating frame. 

4.22 The nucleus of an atom of gold is bombarded with alpha particles each having a kinetic 
energy 7.6 MeV. Determine the spatial domain around the gold nucleus, which is not 
accessible to the alpha particles. 

4.23 If the sun is moving through the interstellar cloud with a speed of 23 km/s and the 
interstellar cloud has a density of 5 hydrogen atoms per cm 3 , how much would be the 
yearly increase in the mass of the sun due to accretion ? 

4.24 Show that when measured with respect to the mean sea level the gain in tidal height 
at the (sublunar) high tide is about twice the drop in tidal height at its low tide. 

4.25 Assume that a close pair of identical satellites of radius r, and mass density p, is 
revolving around a primary planet of radius R p and mass density p p . If the pair 
touches each other physically and are radially aligned for a moment, can there be 
a situation fbr which the mutual gravitational attraction between the pair will not 
be sufficient to resist the tidal force of the primary acting on them? Show that the 
condition for winning over the tidal disruption does not depend on the size of the 
satellites, but only on the ratios of densities and the orbital radius to R p . The latter 
ratio, called the Roche lobe ratio, is an ideal ratio for ring formation inside the Roche 
lobe of any homogeneous spherical object. 

4.26 A rotating and self-gravitating interstellar cloud is in virial equilibrium having a 
magnetic field trapped inside. Find the virial for the system. 


Copyrighted material 



5 

Hamilton’s Equations 
of Motion 


5.0 INTRODUCTION 

In chapter 2, we were concerned exclusively with Lagrangian dynamics, that is, the equa¬ 
tions of motion were obtained from a knowledge of the Lagrangian of the system. For a 
holonomic system described by a set of n generalised coordinates, there exists an equiva¬ 
lent formulation of its dynamics, a formulation in terms of its Hamilton’s function or the 
Hamiltonian. Just like the Lagrangian, the Hamiltonian of a system can be used to obtain 
the equations of motion of the system. The chief disadvantage of Lagrange’s equations of 
motion is that they are second order total differential equations in generalised coordinates. 
Hamilton’s equations of motion, on the other hand, are first order total differential equations 
in generalised coordinates and generalised momenta. 

Sir William Rowan Hamilton (1805 - 1865), born in Dublin, was said to be a child prodigy. 
Having been brought up by his uncle, a distinguished philologist, William could read English 
at the age of three, became a good geographer at the age of four, could translate Latin, Greek 
and Hebrew at the age of five, and by 13, he mastered 13 languages that included Hindi, 
Marathi, Bengali, Sanskrit, Malayalam and Chinese. All his life he loved animals and, what 
is regrettably rarer, respected them as equals. Up to the age of 15, he showed little sign of 
interest in science or mathematics. However, he started reading Principia at the age of 16, 
and by 17 he mastered mathematics through integral calculus, gained enough knowledge in 
mathematical astronomy to enable him to calculate the timings of eclipses, and wrote Part 
I of his book, A Theory of Systems of Rays, which got published when he was 22. Hamilton 
never attended any school before going to university, through his admission to Trinity college 
at 18, on scoring the highest marks in the admission test, which was contested by about 100 
candidates. 

By 21, he remodeled geometrical optics entirely. He demonstrated that all researches on 
any system of optical rays can be reduced to the study of a single function, called ‘charac¬ 
teristic function’ (we shall read more about it in chapters 6 and 10). He became the Royal 
Astronomer of Ireland, the most prestigious Professor’s Chair won by an undergraduate 
of 22. Within a couple of years, he turned towards mechanics and remodeled it entirely 
in terms of his ‘characteristic function’. His contemporaries had mixed feelings about his 


Copyrighted material 



Hamilton’s Equations of Motion 181 


works; some said his whole scheme was just an intellectual exercise, others thought him to 
be a genius and a real pioneer. It was only after the advent of quantum mechanics in the 
twentieth century, that his works got proper attention. Now every one knows how crucial is 
the role played by the concept of the Hamiltonian in quantum mechanics, Hamilton’s prin¬ 
ciple in all field theories, Hamilton’s principal function in the path integral formulation of 
quantum mechanics, and the Hamilton-Jacobi differential equation for a complete classical 
solution to dynamical problems. Through the works of Gibbs, his incipient idea of phase 
space became the only useful way of studying statistical mechanics. 

However, for the purpose of the present chapter, the logical connection between the 
Lagrangian and Hamiltonian is formally established through the so called Legendre trans¬ 
formation, which we now proceed to describe. 


5.1 LEGENDRE’S DUAL TRANSFORMATION 


Theorem: Let a function F( 111 , 1 * 2 , • •. ,i*») have an explicit dependence on the n independent 
variables 1 * 1 , 113 ,.Let the function F be transformed to another function G = 
G(v i,t> 2 ,...,v n ) expressed explicitly in terms of a new set of n independent variables 
®i»t> 2 ». ••»«»» where these new variables are connected to the old variables by a given set of 
relations 

OF 

Vi = i = 1 . n < 51 > 

and the form of G is given by (?(vi,t» 2 ,..• ,v„) = UiVi - F(t*i,u 2 ,...,t* n ). Then the 
variables 1 * 1 ,.. .,u n satisfy the dual transformation, namely, the relations 


and 


Ui = 


dG 

dv x 


i = 1 ,...,n 


F(tii,...,«») = U{Vi - G{ Vi,...,V n ) 


(5.2) 


This duality of transformation between the two functions F( 1 * 1 ,..., u n ) and G(vi y ....v n ) 
and also between the two sets of variables given by the Eqs (5.1) and (5.2) are referred to 
as Legendre’s dual transformation. 


Proof. Since the form of G is given by 

G(v i,...,v„) = u^i - F(tt lf ...,u„) 
from the left-hand side, 6G = ( 8G/dvi)6vi , and from the right-hand side 


so that 


QF 

SG = UiSvj + ViSui - -—bui 

aui 


dG s e ( dF\ c 

= v ‘ evi + r* - frrJ Su ‘ 


Copyrighted 



182 Classical Mechanics 


Since it is given that, v, = ( dF/dui ), one must have its dual 

BG 


as SvSs are arbitrary because all t>,’s are independent. Thus the duality of the transformation 
is proved. Note that G = u,Vi - F can simply be rearranged to write F = UiVi - G. 
Further, it is easy to see that we could have started from Eq. (5.2) and F = u»v< - G 
and proved Eq. (5.1) and G = u,Vi - F in exactly the same way. 


5.1.1 Extension of the Theorem to Include Passive Variables 


Now suppose that there is a further set of m independent passive variables 
which are present in both F and G. Then there would be some extra conditions for Legen¬ 
dre’s dual transformation to be satisfied. These conditions are 


8F 8G 

dwi 9wi 


(5.3) 



dG . dG dF 

u ' = *T aud '57“ = “ a— 

dvi dwi dwi 

which proves the assertion. Note that the latter relations have acquired a negative sign. 
Example 

In thermodynamics the four thermodynamic potentials, namely, 


the internal energy U' = U'{S\V') 
the free energy F' = F'(V',T') 
the enthalpy H' = H'{S' t P') 
and Gibb’s potential G' = G'(P\T') 

are all connected by Legendre’s dual transformation. Here the independent variables are 
any two, out of the entropy S', volume V', temperature V and pressure P'. A change in 
the pair of independent variables defines a new potential function which is connected to the 
old one by a suitable Legendre’s dual transformation. 

For example, a dual transformation of U'(S', V') can be the free energy F^V'.T') where 
V' remains as the passive variable and the variable S' is transformed to T'. Of course, 


Copyrighted 




Hamilton’s Equations of Motion 183 


this is possible if a relation like 



exists for a change over of the active variable S' in U' to T' in F'. Then by Legendre’s 
dual transformation theorem (except for a negative sign), we shall have 


F , (V\T') 



Similarly one can find that 

H' = U' + P'V' with 

and 

G' = F' + P'V' with 


U'{S\V') - T'S' 


and 

OF' 

dU' 


dv> 

dV' 


P' = 

dU' 

dV' 

and 

v- - d JL 
op• 

P' = 

OF' 

dv> 

and 

v - 

dP' 


as dual transforms of U' and F' respectively. 


5.2 HAMILTON’S FUNCTION AND HAMILTON’S EQUATIONS OF MO¬ 
TION 


Let us now apply Legendre’s dual transformation to the Lagrangian of a system L(q\ 

with qi as the active variables and < 7 , and t as the passive variables. The dual 
variables of i = 1,..., n are given by the generalised momenta 


Pi = 


dL 

dqi 


i = l,...,n 


Hence the dual function of the Lagrangian L is 


(5.4) 


H = p { qi - L{q,q,t) 


(5.5) 


where 

H = H(qi,...,q n ,p,. ..,p«,<) = H(q,p,t) 

and in short, is called Hamilton’s function or the Hamiltonian of the system. The dual 
transformation of Eq. (5.4) is 

dH 


i = 1,...,; 

opi 

and the Eqs (5.3) for the passive variables take the form 

dL _ _0H_ 
dt dt 


(5.6) 


(5.7) 


Copyrighted material 



184 Classical Mechanics 


i = 1, 


(5.8) 


and 2L - _ 

dqi dqi 

We know that provided (i) there are no non-potential forces, (ii) the system is holonomic 
and bilateral, and (iii) Euler- Lagrange’s equations of motion are valid, one can write 


dL _ d_ / dL\ _ . 
dqi dt \dqi) Pt 


Substituting in Eq. (5.8) we get 


OH 


i = 


(5.9) 


The set of Eqs (5.6) and (5.9) together is called the Hamilton’s equations of motion or 
the canonical equations of motion. Sir William Hamilton derived these equations in 1835. 


5.3 PROPERTIES OF THE HAMILTONIAN AND OF HAMILTON’S 
EQUATIONS OF MOTION 


1. If the Lagrangian does not have any explicit dependence on time, the Hamiltonian 
also does not depend explicitly on time. This simply follows from Eq. (5.7) above. 

2. Consider ^ 

—//( 9 i,...,9„,Pi,...,Pn,0 


dH_ 

dt 


(W 

dt 

dH_ 

dt 

dH 

dt 


dH . 


dH . 


dp, 
Mi + Mi 


Pi 


(5.10) 


where we have used Hamilton’s equations of motion. Thus if the Lagrangian or equivalently 
the Hamiltonian does not explicitly depend on time, the Hamiltonian is a constant of motion. 
In fact, in this case, 

H = piqi - L = E 


is the energy integral for conservative systems, for which H = T + V = E. Otherwise, 
in general if H ^ H{t) and L ^ L(t), there will exist a constant of motion, called the 
Jacobi integral given by the function 


H = a const. = J 


which need not be identical with the actual energy E. 

3. Hamilton’s equations of motion are first order total differential equations. But the total 
number of equations of motion is 2n, unlike the n second order total differential equations 
for Euler-Lagrange’s equations of motion. Thus, even in the Hamiltonian formulation, a 


Copyrighted material 



Hamilton’s Equations of Motion 185 


system with the number of DOF = n has 2n - 1 independent constants of motion, as it 
should. 

4. Hamilton’s equations of motion are symmetric in qi and p, except for a change in sign 
in the second set. Thus it does not make any essential difference, which of the two sets of 
quantities g,’s and p,’s is called the coordinates or the momenta. Their roles can be trivially 
interchanged just by making a change of (relative) sign. Thus the new set of variables, say 
(Q it Pi) defined by 

Qi = -Pi and Pi = 

should leave Hamilton’s equations of motion unchanged and hence are equivalent to the set 
(<?iiPi) for the description of the dynamics of the system. Thus generalised momenta and 
coordinates are dynamically equivalent sets of variables. This is quite explicit in quantum 
mechanics, even though this point was never appreciated during Hamilton’s life time, and 
some people even branded him as a crazy man, with crazy ideas, that are purely mathemat¬ 
ical and which have no relevance to meaningful physics! 

5. The Hamiltonian and Hamilton’s equations of motion can be derived only for holo- 
nomic systems. This is a cognisable restriction for the dynamics'of bodies of large mass, 
whose motion is constrained by hard and immovable surfaces of various kinds. However, 
for the dynamics of microscopic systems like the atoms, molecules, etc. the forces involved 
are the known definite forces exerted by microscopic particles on one another and can be 
accounted for as the applied forces on the system. In these circumstances, the above restric¬ 
tion, and indeed the whole concept of constraints and constraint forces, becomes artificial 
and unnecessary. So the Hamiltonian dynamics can fundamentally enjoy an unrestricted 
applicability in the domain of microcosms. 

6. The knowledge of the Hamiltonian of a system is extremely important, particularly if 
we are interested in quantising a dynamical system. As a rule, one starts with the classical 
Hamiltonian function and then replaces the generalised coordinates and momenta by the 
corresponding differential operators or the matrices, for setting up the Schrddinger equation 
or Heisenberg’s matrix equation. 


6.4 ROUTHIAN 


This is another potential function constructed out of the Lagrangian and plays a role some¬ 
what intermediate between the Lagrangian and the Hamiltonian. The Routhian is a function 
of mixed variables g,, qi and p<, where the number of qi coordinates is n = the number 
of DOF and the rest n velocity-like independent variables are shared by q ’s and p’s. The 
construction of a Routhian is meaningful only if there are some cyclic coordinates in the 
Lagrangian. If the first k out of n coordinates are cyclic in L then the Routhian for such 
a system is defined as 


R = i 7 ...,$»,<) 

k 

= - £(?1.-•.?».«!.in,() 


(5.11) 


Copyrighted material 



186 Classical Mechanics 


where t = 1,..., k are constants of motion. Obviously the first equality, which is merely 
a functional definition of R gives 



but the second equality in Eq. (5.11), that is, the defined explicit expression for R gives 
iR = + iidpi) - ^dt - £ {^1- + 


= - Y pidqi + Y * dpi - ^7 - Y pidqi 

i=k+ 1 »=i *=i 

Now, comparing these two expressions for dR and treating the arguments of R as inde¬ 
pendent variables, we have, for the first k coordinates 


dR A 

« = Wi 


Pi 


dR 

- -r— for * = 1,...»A: 

aqi 


(5.12) 


But for the rest n - k coordinates, that is, from i = k + 1 to i = n, we get 


dR 

dqi 


and 


dR 

Pi = -m 


Combining the last two equations, we then get 


d f dR 


dt [dft 

Finally for the t variable 


dR 


dR 

dt 


for * = k + l,...,n 

_ dL 
~ dt 


(5.13) 


Hence for the first k coordinates which are supposed to be cyclic, Eqs (5.12) are like 
the Hamilton’s equations of motion with H replaced by R. They would directly conserve 
momenta pi since R is cyclic in the corresponding g,. The rest n - k coordinates in R 
are seen in Eq. (5.13) to satisfy Lagrange like equations of motion in R instead of L. The 
total energy in terms of the Routhian is therefore given by 


E = Y Piqi ~ L 


R + Y = R “ 

i = k + i oq ' 


(5.14) 


Since the first k coordinates are cyclic in R, the first k momenta are just constants 
of motion and cease to remain as variables. So the Routhian function will then effectively 
behave like a Lagrangian of a system having the number of DOF = n - k. In fact the 
Routhian satisfies the Lagrange-like equations of motion in all these n — k coordinates. 
This effective reduction in the number of DOF is the chief advantage of the use of Routhian. 
It is because of this reduction in the number of DOF that the Kepler problem in three 


Copyrighted material 



Hamilton’s Equations of Motion 187 


dimensions can be reduced to a planar problem with an effective number of DOF = 2, even 
though the orbit still remains describable in terms of all the three coordinates, r, 6 and 
<f>. Thus the number of DOF of a system, basically determined by the number of constraint 
relations, can effectively get reduced further, by the dynamical symmetries of the system. 
The constants of motion can indeed serve as extra constraints, which when expressed in 
terms of q's and q's, may look like nonholonomic constraints. For example, in the Kepler 
problem above, one can always define an arbitrary set of spherical polar coordinates, that 
will lead to p+ = mr 2 <f> = constant, which can be viewed as a nonholonomic constraint in 
r and <f>, but in reality the motion is free from any mechanical constraints. 


5.5 CONFIGURATION SPACE, PHASE SPACE AND STATE SPACE 

The n-dimensional space spanned by all the n generalised coordinates of any dynamical 
system is called the configuration space of that dynamical system. 

The phase space is, on the other hand, spanned by all the n generalised coordinates and 
the corresponding n generalised momenta forming a 2 n-dimensional space. 

The state space is a 2n + 1 -dimensional space where one more dimension is added to 
the phase space to include the parameter time. This is also sometimes called it extended 
phase space. 

One can also define an + 1 -dimensional extended configuration space or event space by 
extending the n-dimensional configuration space to include the parameter time. This might 
look like a space-time continuum, but it is generally not so, as it is not a metric space. In 
fact none of the above types of fictitious mathematical constructs of hyperspaces defines 
any measure of scalar distance between two neighbouring points, so as to be called a metric, 
which should remain invariant under admissible coordinate transformations. 

The Lagrangian L(q j,... ,q„,qi,... ,q n ,t) is always described in the configuration space of 
the requisite dimension set by the total number of generalised coordinates (including cyclic 
ones ) of the system. The set (tfi,... ,q„) specifies a definite point in the configuration space 
which is the location of the system in the same space. The set of generalised velocities 
<ji,..., < 7 „ represents the instantaneous direction and magnitude of the motion of the point 
(<7i, • • •» 9 n) at time t. Euler-Lagrange’s equations of motion determine only the curvature 
and torsion of the trajectory of the system in the configuration space provided the system 
is represented by a particular point in this space at time t. 

On the other hand, the Hamiltonian H(q\ ,... ,q n ,Pi, • • • ,p n »<) is described in a 2n- 
dimensional phase space where each point has got coordinates and momenta uniquely spec¬ 
ified so that Hamilton’s equations of motion determine the course of evolution of the system 
in the form of a definite trajectory' (or ray) in the phase space. This trajectory (and hence 
the motion of the system) is literally canonical, as it is predetermined, as if providentially, 
by the specified Hamiltonian. But in the case of the Lagrangian and Euler-Lagrange’s e- 
quations of motion, the generalised velocities can be arbitrarily specified through the initial 
conditions for any given point in the configuration space and hence the course of evolution 
is not determined by the choice of the point alone. In a phase space the trajectory of phase 
point is analogous to the trajectory of a fluid particle in the motion of a fluid system. This 


Copyrighted material | 



188 Classical Mechanics 


is the reason why the occupant of a dynamical system in a phase space is often called a 
phase fluid. 

In the phase space or the configuration space the trajectories may or may not be closed. 
But in the state space, however, two different trajectories may intersect in case of collisions, 
but a single trajectory never forms a closed loop. All trajectories in state space are open. 
A closed system evolves like a bundle of fibres in the time direction. 

Example 

Let us take a simple pendulum of fixed length l and bob mass m (Fig. 5.1). Take 9 
to be the generalised coordinate. For small 0 the Lagrangian of this system can be written 
as 

L{9,9) = 1 m(l 2 0 2 - gW 7 ) 

where g is the constant acceleration due to gravity. This Lagrangian can be used to obtain 
Euler-Lagrange’s equation of motion which has the form 

9 + = 0 

Again the Lagrangian gives the generalised momentum as 




Fig. 5.1 Motion of a simple pendulum in real space 


Finally the expressions for the total energy and the Hamiltonian are easily obtained as 
E = + gW’) and ^ + \ mgW> 


Copyrighted material 




Hamilton’s Equations of Motion 189 


The Hamiltonian yields Hamilton’s equations of motion as 


Pe = - ' 


Pe 
ml 2 


dH to ao dH 

W =-mgie and 

These are the two first order total differential equations. One can combine these two coupled 
equations and eliminate dt. We then get 


dpo m 2 gl 3 9 

dO pe 


which can be easily integrated to give 

p 2 g + m 2 gl 2 9 2 — const. 

This is the equation of an ellipse in coordinates pe and 9. Figure 5.2 shows the configuration 
space, the phase space and the state space diagrams of this system. 


5.0 LAGRANGIAN AND HAMILTONIAN OF RELATIVISTIC PARTICLES 
AND LIGHT RAYS 

One of the most famous equations of special relativistic mechanics, that is quoted even by 
a layman is E = me 2 , given by Albert Einstein in 1905. Here E is the total energy 
associated with mass m of a particle, both measured in a given inertial frame of reference. 
The relativistic momentum of a free particle is p = mv. In both these expressions mass is 
a frame dependent quantity. If a particle has got a rest mass m„, it s mass mea sured in a 
frame moving with a relativistic velocity * is given by m = m 0 /^J 1 - v*/c 5 , c being 
the speed of light in vacuum. 

Now, for a free particle the energy is 

E = p v - L(r,v) or me 2 = mv 2 - L(r,v) 

which gives 

L(r,v) = m(v 2 - c 2 ) = - m 0 cy/<? - v 2 (5.15) 

This is the Lagrangian of a relativistic free particle. 

If the particle is moving in a conservative potential field given by V(r), the relativistic 
Lagrangian of the particle may be written as 

L(r,v) = - m 0 c 2 yjl - v 2 /c 2 - V(r) (5.16) 

In the non-relativistic limit, (t> < c), one gets from Eq. (5.16) 

L(r,v) = ^m 0 t> 2 - V(r) - m 0 c 2 

Except for the constant term, - m 0 <^ y this is the expected classical Lagrangian for a particle 
moving in V(r). 

If a relativistic charged particle is moving in an electromagnetic field, the Lagrangian for 


Copyrighted material 



190 Classical Mechanics 


, 9 

-9o +0o 



Configuration Space 
diagrams of I D harmonic 
oscillator 

(a) 



Phase space diagram of a 
1-D harmonic oscillator 


(b) 



State space trajectory of a 1-D harmonic oscillator 

Fig. 5.2 Motion of a simple pendulum in its 1-D configuration space (a), 2-D phase space 
(b), and 3-D state space (c) 


this particle can be obtained by subtracting the generalised potential U(r,v) = e<t> - e(A- 
v) from the Lagrangian of a relativistic free particle given by Eq. (5.15). Thus we get, for 


Copyrighted material 



Hamilton's Equations of Motion 191 


a relativistic charged particle 

Z,(r,*) = — m 0 cy/ c* — v 2 — e<f> + eA- v (5.17) 

where e is the electric charge on the particle, A = A(r , t) is the electromagnetic vector 
potential and <f> = <f>(r, t) is the scalar electric potential at the location (r) of the particle 
at time t. 

Next, we wish to derive the Lagrangian for light rays. For a freely propagating beam 
of light, whether traveling in a transparent optical medium or in vacuum, according to the 
special theory of relativity, the rest mass associated with the photon is zero. Hence from 
Eq. (5.15) the Lagrangian for light rays is 


L(r,v) = 0 


(5.18) 


However, this is not valid for a trapped beam of light, say inside a hot plasma. In that case 
the photon has a finite range, thus having a non-zero effective rest mass. 

Once the Lagrangian is known, it is fairly easy to derive the corresponding Hamiltonian. 
As an illustration of the procedure of constructing a Hamiltonian from a given Lagrangian, 
we take the case of a relativistic free particle. First we define the canonical momentum 


dL m 0 v 

-^/l — u a /c* 


(5.19) 


Since the Hamiltonian H(r,p) = p v - L is a function of r and p alone, we must 
express * as a function of r and p only so that 

H{r,p) = F-«(r,*) - X(r,w(r,p)) 


From Eq. (5.19) 


v 2 /c 2 




-1/2 


Therefore _ 

H[r,p) = v / P*c 5 ~T~rn 2 c 5 ’ 


(5.20) 


Hence, corresponding to the Lagrangian (Eq. (5.16)), the Hamiltonian should be 


H(r,p) = Vp 2 *? + m W + ^( r ) 


(5.21) 


In a similar manner it can easily be shown that the Hamiltonian of a relativistic charged 
particle moving in an electromagnetic field is 

H(r,p) = yfip^TX^T+lrtZ + e<i> (5.22) 

where the canonical momentum is given by 


P = 


. m 0 « 

y/1 - V 7 /c ? 


+ eA = mv + eA 


(5,23) 


Copyrighted material 



.92 Classical Mechanics 


Similarly for light rays the Hamiltonian is given by 

H{r,v) = p v - L = p v 

But v should now be expressed as a function of (r,p). In an optically transparent medium, 
the speed of light is v = c//i, where p is the refractive index of the medium. In general 
/i is a function of position (r) and wavelength (A). In the photon picture of light, the 
wavelength A is related to the momentum p of the photon by |p| = h/\, where h is 
Planck’s constant. Hence, one can write 

= im = WJ) (5 - 24) 

Using this Hamiltonian, one can obtain the equations of motion for propagation of light 
rays in any optical medium, given by 


dH _ cp _ cp dp(r,p) ' 
dp ~ pp(r,p) p 2 dp . 


(5.25) 


P = - = ^Vp(r,p) (5.26) 

where p = |p|. 

It can be shown that f in Eq. (5.25) corresponds to the group velocity of the wave 
packet associated with the photon. Equations (5.26) can be used to derive the trajectory of 
any light beam in any dispersive medium characterised by a given p(r,p), or equivalently 
p(r, A). 


5.7 RELATIVISTIC MASS TENSORS 


The idea of tensors has been introduced in Appendix A3, and the reader is asked to refer 
to it before continuing with the present section. 

In the previous section, we have already derived various forms of the Lagrangian for a 
particle moving with relativistic speeds under the action of some external force fields. The 
relativistic dynamics can also be studied by covariantly expressing Newton’s second law of 
motion, that is, by forcing the validity of the Newtonian definition of force, F = dp/dt 
where p = mv is the relativistic momentum of the particle moving with velocity v, 
the relativistic mass m = m 0 7 , m 0 being the rest mass of the particle, and 7 = 

(1 — v 7 /c 2 )~ */ 2 is the so called Lorentz 7 -factor. Hence the tth component of the force 
can be expressed as 


Fi = -(mv.) = 7 3 m 0 


ii . fpfi _ (ko 7 \ 

.2 + /«2 A* At At2 m,J Ail (5.27) 


where the relativistic mass tensor 



(5.28) 


Copyrighted 



Hamilton's Equations of Motion 193 


Obviously the direction of the force is no longer parallel to the direction of the acceleration, 
because of the presence of the second term in Eq. (5.28) which can be neglected only if 
ViVj/c 2 is small compared to 7 “ 2 . 


There are of course certain special cases in which behaves like a scalar and the force 
becomes parallel to the acceleration. For example, when the force F is acting parallel to 
the direction of motion of the particle, that is, F is parallel to v 


Fi 


= 7 3 ™« 


<jPxi 


<Pxi 


(5.29) 


where mj = 7 3 m 0 is called the longitudinal mass of the particle. Again, when the force 
F is acting in a direction, normal to the direction of the motion 


Fi 


(Pli 

°!i*~ 


(Hu 

m '~d0 


(5.30) 


with m t = 7 m 0 = the transverse mass of the particle. All it means is that the mass 
tensor m,y in the form of Eq. (5.28) is in general not diagonalisable for any arbitrary 
direction of motion with respect to the direction of the applied force. It becomes so only 
when one of the principal axes (or the characteristic vectors) is aligned parallel or transverse 
to the direction of the force, which produces the diagonal elements as mj ,m t and m t , the 
transverse components being doubly degenerate. In this way, the mass tensor now can be 
reduced to behave as a vector. Finally, one can form a scalar out of the mass tensor by way 
of contraction of indices and the mass scalar for the Lorentz frame will turn out to be m 0 , 
the rest mass of the particle. 


Now let us formulate a possible form for the mass tensor for any given Lagrangian 
L = L(qi ,..., , 91 ,...,,<), for a nonrelativistic system. Using the operational identity, 

d _ d . d _ d 
dt dt + qj dq } + qi dqj 
the usual Euler-Lagrange’s equations of motion namely, 


d_ (dL\ _ dL _ 
dt \dqi) dqi 

can be reduced to 

• [ d2L ] _ M 

9j [ dqidqj J “ dqi 


(5.31) 


where we have assumed that dL/dqi (= p,) is not an explicit function of q \,..., q n and t. 
Since Euler-Lagrange’s equations of motion can be written as 


dp± = dL 
dt dqi 


so that ( dL/dqi) can be regarded as the Newtonian equivalent of some kind of generalised 
force and <ji as the corresponding generalised acceleration, an effective mass tensor can be 


Copyrighted material 



194 Classical Mechanics 


defined from Eq. (5.31) as 


rriij 


d 2 L 

dqidqj 


(5.32) 


One can now easily see that the Lagrangian for the relativistic particle moving in a 
potential field V’(r, <), as given by Eq. (5.16) or the one moving in an electromagnetic field 
with the generalised potential given by Eq. (5.17) yields the same result for the mass tensor 
as given in Eq. (5.28) (see problem 5.11), provided the definition of rriij is taken to be the 
one given above, namely in Eq. (5.32). In fact, for the motion of the relativistic charged 
particle having charge e and moving in an electromagnetic field, one can show that 

d?r- 

+ y 2v i v i)~^r = eE i + e ( v x B )i (5.33) 

where E and B are the electric field and the magnetic induction respectively at the location 
of the moving particle as seen by any inertial observer. 


The first term on the LHS of Eq. (5.33) is m 0 7 (d 2 r,/dt 2 ) which is nothing but the 
ordinary relativistic force with mass m = m„ 7 . The corresponding acceleration is in the 
direction parallel to the forces expressed on the RHS of Eq. (5.33). But the second term 
on the LHS of Eq. (5.33) can be rewritten in the vector notation as m„ 7 3 (t> • a)v/c?, where 
a = <Pr/dt 2 denotes the acceleration vector. This term can be taken to the RHS of Eq. 
(5.33) and interpreted as a relativistic constraint force f c = — m„ 7 2 (0 • a)t»/c 2 . This force 
is antiparallel to 0 when the angle between 0 and • is acute, and is parallel to 0 otherwise. 
Now the angle between a and 0 is acute only when the particle is accelerating forward 
or speeding up and in that case the component of 0 perpendicular to a will undergo a 
negative acceleration. 


It is also possible to consistently define a mass tensor from a given Hamiltonian H = 
H (qx,.. •,q n ,Pi, • • • ,Pn, t) in the following way. From Hamilton’s equations of motion, 


~ _ dqi_ _ d_\9H] _ d 2 H . &H . 8 2 H 

q ' dt dt [ dpi \ dpidpj Pl + dpidqj + dpidi 

_ &H dH d 2 H dH d 2 H 
dpidpj dqj + dpidqj dpj + dpidt 

From the first term on the RHS, the inverse mass tensor m ,” 1 can be defined as 


6 2 H 

dpidpj 


(5.34) 


(5.35) 


because, when H does not depend explicitly on time, the system is conservative with the 
generalised force dH/dqi = Q { . Here again, we have assumed that dH/Qqi is not a 
function of pi so that the last two terms on the RHS of Eq. (5.34) vanish. We thus have 


q* 


d 2 H 

dpidpj 


Qi 


m 7 j 1 Qi 


Copyrighted material 



Hamilton's Equations of Motion 195 


Furthermore, one can also verify that 

&L d 7 H 
dqidqj dpidpj 


(5.36) 


provided m*,- is itself nonsingular. For example, if we consider the Lagrangian and the 
Hamiltonian for a relativistic particle moving in a conservative force field we can easily 
verify Eq. (5.36) using Eq. (5.28) and the expression for mjj 1 from its definition (Eq. 
(5.35)) 

-1 -l -\(t v i v i\ 

= = m ° 7 - 77 


Equation (5.35) is, however, most profitably used in the calculation of the effective mass 
of an electron or hole moving inside a solid lattice for which the dispersion relation w(k) or 
E(p) is known. Taking pi = hki and H = E, Eq. (5.35) transforms into 


_ 1 &E 


(5.37) 


where k is the wave vector associated with the motion of an electron or hole in the lattice, 
and E is the energy of the same electron or hole as a function of k. 


5.8 SUMMARY 

Hamilton derived the equations of motion in 1835 in terms of a function called the Hamil¬ 
tonian H = H(q,p,i), which can geometrically describe a surface in phase space. If one 
draws a normal to this surface H(q,p,t) = E (say, a constant) at a point (q,p) in the 
phase space, its projections onto the individual coordinate and momentum axes represent 
the time rate of increment of the respective coordinates and momenta. So unlike the case 
with the Lagrangian described in configuration space, the evolution at all points of the phase 
space is completely specified with the specification of the Hamiltonian. 

For conservative systems, this Hamiltonian function represents a constant of motion, 
which can readily be identified with the total energy of the system. Moreover, the paths of 
evolution in phase space always lie on the surface of the given constant energy. 

The description in terms of the Hamiltonian is extremely useful even for quantising any dy¬ 
namical system under consideration. The knowledge of the explicit Hamiltonian is required 
to construct Schrodinger’s equation, as that of the Lagrangian is required for developing 
field theory. 

The concept of relativistic and electromagnetic mass tensors is introduced in the last 
section, which elucidates some aspects of the motion of relativistic and charged particles that 
are not found in usual text books. Also the construction of the Lagrangian and Hamiltonian 
for such systems in section 5.6 gives an idea of how the same results, namely the same 
equations of motion, can be derived using so many different techniques. While reading this 
chapter, one must keep in mind that Einstein’s summation convention is implied wherever 
it applies, unless stated otherwise. 


Copyrighted material 



19C Classical Mechanics 


PROBLEMS 


5.1 If the Lagrangian L = L a + L\ + L 2 H-, where L r is a homogeneous function 

of degree r in < 7 , with coefficients as any function of < 7 ,, prove that 

(a) the Hamiltonian is given by 

H — — L 0 + L 2 4 - 2 L 3 + • • • 

(b) H = 0, for L = L\ 

5.2 Write down the complete set of Hamilton’s equations of motion for a dynamical system 
for which the Lagrangian is given by 

L = ~ V(q) ij = l,...,n 


5.3 If 2k = ^-(g* + ip*) and r* = -^-(< 7 * - ip*), show that Hamilton's equations of 
motion can be expressed in the following compact form 
dz k ^ OH 

—J— + > TTZ- = 0 

dt dzk 

where i = y/— 1 . 


5.4 If a dynamical system is subject to constraints of the form XfSqt = 0 

1,..., k] i = 1,..., n; n > fc), deduce Hmnilton’s equations 


U = 


where X i, are suitably constructed from Xf and \j are Lagrange’s undetermined 
multipliers. 


5.5 The Hamiltonian for a 3-D isotropic harmonic oscillator is given by 

H = + P9, ? ) 

“1 = 1 

Show that, 


Fi 

= 92P3 

- 93P2 



f 2 

= 93Pl 

- 9lP3 



f 3 

= 9lP2 

- 92 P 1 



Gi 

= nq\ cos(fxt) - 

Pi 

sin (fit) 

g 2 

= nq 2 cos(/it) - 

P2 

sin (fit) 

C?3 

= nq 3 cos(nt) - 

P3 

sin (fit) 


are the constants of motion. 


Copyrighted 



Hamilton’s Equations of Motion 197 


5.6 For the following Lagrangians, find the corresponding Hamiltonians: 

(i) L(z,z) = 5 Z 2 - j u) 2 x 2 -ax 3 + 0xx 2 , for an anharmonic oscillator, 

(ii) L(9, z,9, z) = jm{l 2 9 2 - 2/0isin0) + mgl cos 9 + \z 2 4 - mgz, for a pendulum 
(/, 9) hung from the ceiling of a moving lift, the instantaneous position of the fulcrum 
being denoted by z(i). 

(iii ) L(q,q,t) = jGfajf )*) 2 + F(q,t)q - V(q,t) } for particle motion in resistive 
media embedded in a conservative field. 

(iv) L(z,z) = jmz 2 [l + (df/dx) 2 ] — mgf(x), for a bead of mass m sliding 
smoothly along a wire of shape z = /(z), 2 -axis and z-axis being respectively 
horizontal and vertical. 

5.7 Find the Routhian for the following Lagrangians: 

(i) L - jn(r 2 + r 2 9 2 ) + GMm/r ; n = mM/(m + Af), G, M, m are 
constants. 

(ii) L = 5 / 3 ^ + ^cosfl ) 2 + %Ii(9 2 +0 2 8in 2 9) - mglcos9\ 

hi hi mi 9i t are constants. Find also the effective potentials for the r-motion in 
the first case and for the 0 -motion in the second case. 

5.8 Show that for any closed system, the translational and rotational invariance of the 
Hamiltonian H(r,p) for infinitesimal translation and rotation leads to the conservation 
of the total linear and angular momentum respectively. 

5.9 Set up Hamilton’s equ ations of motion for the following Lagrangians: 

(i) L(r,v) = — moCy/c 2 — v 2 -f eA v — e<f> 

(ii) L(g,g,<) = m(g 2 sin 2 u;< + qqusmTut + «V)/2. 

5.10 Find the equation for the trajectory of light ray through the atmosphere for which the 
refractive index (n) decreases linearly with height 2 as n(z) = 1 + n 0 exp(-z/H) 
and explain the phenomenon of mirage. Take n 0 = 1.000292, H = 8.0 km. 

5.11 Using the definition of m.j given by Eq. (5.32) and the Lagrangian for the motion 
of a relativistic charged particle, as given in (i) of problem 5.9, show that it satisfies 
the equation of motion given by Eq. (5.33). 


Copyrighted materia| 



6 

Principle of Least Action 
and Hamilton’s Principle 


6.0 INTRODUCTION 

In this chapter we are going to introduce a new way of formulating dynamical problems. This 
technique goes by the name of the variational principle. It involves a good deal of variational 
calculus. We know that Leibniz and Newton had independently invented differential calculus 
some time before 1675. The foundation of integral and variational calculus was laid by Jean 
Bernoulli around 1690, and further developed by Euler in 1734 and by Lagrange and Euler 
in 1762. 

Suppose we ask the following question: how do you define a circle? In fact, it can be 
defined in many different ways. For example, one of you may define the circle as a locus of 
points in a plane equidistant from a given point. When expressed mathematically it may 
read as {x - xi ) 2 + (y - y\) 2 = a 2 , where is the given point called the centre 

of the circle, and a the given distance called the radius of the circle. In polar coordinates 
with the origin at the centre, the equation looks very simple, r = a. Another one of you 
may define it as a locus of a point at which the angle made with two given points in a plane 
is always a right angle. If the distance between the two given points is 2a, and one of the 
points is chosen to be the origin of a plane polar coordinate system, the equation of the 
circle becomes r = 2a cos 6. One of you may also define a circle as a curve of constant 
curvature drawn in a plane. Mathematically speaking, this definition implies the equation 
of circle to be dO/ds = a -1 or, a Py/dx 2 = {1 + (<fy/dx) 2 } 8/2 . Unlike the first 
two, this one is a differential representation. In the same spirit, the variational definition 
of a circle would be the one that encloses an area of given magnitude with the smallest 
possible arc length draw n in a plane. Again, mathematically speaking, the equation would 
be 6 § y/r i + ( dr/d8 ) 2 d$ = 0 , with the constraint relation for the area given by § r 2 dO 
= constant. This is known as the problem of isoperimetry proposed by Jean Bernoulli 
around 1690. It should be noted that all the four definitions are equivalent. 

In order to make the analogy complete with our business of dynamics, we can readily 
recognise the differential form to represent Newton’s second law of motion, or Euler- La¬ 
grange’s equations of motion or even Hamilton’s equations of motion. The first two forms 
of our example look like algebraic solutions to the differential equations. The motivation for 


Copyrighted material 



Principle of Least Action and Hamilton’s Principle 199 


the search for a possible variational form of the equations of motion came from the work of 
Fermat, who in 1657 suggested that given the initial and final points, light follows a path 
through any given optical medium in such a way that the sum of the piecewise product of 
the refractive index n and the path lengths is an extremum. The question was, for matter 
particles can there be a suitable replacement for the refractive index, so that the sum of 
the piecewise product of this replaced quantity and the path length over the entire path 
between two fixed points could be an extremum? Maupertuis was the first person who got 
an answer to it. However, it was later known that Leibniz also had thought of Maupertuis’ 
principle a few decades earlier than Maupertuis. 

Nevertheless, this problem was so exciting that it involved people like Euler, Lagrange, 
Hamilton, Jacobi and Noether, each contributing substantially to the field. Today, Hamil¬ 
ton’s principle plays a key role in field theory, be it classical, quantum mechanical, or 
relativistic quantum mechanical. Their starting point is Hamilton’s principle. 


6.1 PRINCIPLE OF LEAST ACTION 


A French mathematician Pierre de Maupertuis in 1740 enunciated the famous principle of 
least action. This can be stated as follows. Consider a particle moving from point 1 to 
point 2 in space. Then out of all possible paths between these two fixed points in space, the 
actual path traversed by the particle is the one for which the integral called the action, 



mv ds 


( 6 . 1 ) 


is an extremum, that is, this integral has the largest or the smallest value for the actual 
path. This can also be expressed by requiring that for the actual path taken by the particle 
the first variation of the above integral should vanish, that is, 


6S 


*/' 


mv ds = 0 


( 6 . 2 ) 


Here of ,course we impose the condition that a particle’s energy is a constant of motion. 

This principle was first published as an exact dynamical theorem by Leonard Euler in 
1744, who proved it for a single particle moving in a plane. Finally, Joseph Lagrange 
formulated the principle of least action in a form applicable to general cases like many 
particle systems (1760 - 61). Lagrange stated this principle as follows, 


or 


S ( mi / vi dsj + m 2 / t > 2 <fs 2 + ...) = 

\ Jl J 1 / E = const. 

6 ^rm J Vi 


= 0 


(6.3) 


Before we proceed with Lagrange’s proof we make a study of the basics of calculus of 
variation. 


Copyrighted material 



200 Classical Mechanics 



A variation of the path from any given path in configuration space is always considered 
to be infinitesimal and the variation in the (generalised) coordinates is always taken at 
the same instant. At any given instant of time, for every point on any given path, there 
exists a corresponding point on a neighboring path that differs from the original point by an 
infinitesimal separation in the generalised coordinates, the two points having the coordinates 
qi(t) and g,(<) + Sqi(t). This 6qi(t) is called the variation in the generalised coordinate 
qi(t) at time t. In Fig. 6.1 the solid curve is the curve qi between t = <j, and t = <2 
and the dotted curve is the variation of the above curve. So at any instant t in the range 
t\ < l < 1 2 , the varied path has slightly different coordinates from its value for the original 
curve. Any change in the coordinate that occurs naturally with time, be it represented by 
the solid curve or the dotted curve in Fig. 6.1, is the real change in the coordinate and 
is represented by dqi along the respective path. Figure 6.2 is a magnified view of a small 
portion of the two paths shown in Fig 6.1 so that the paths AB and CD are the segments of 
any two varied paths where point C corresponds to the variation of point A at time t and 
point D stands for the variation of point B at time i + di. We can now reach point D from 
point A via two routes : either by the route ACD or by the route ABD. The coordinates of 
the points A,B and C in Fig. 6.2 are as follows 

Point A : (f, <jj) 

Point B : (f + dt, $ + dqi) 

Point C : (f, q t -f- Sqi) 

Now point D can have its coordinates as either {t + dt , (qi + dqi) + 6(qi + dqi)} if 
we move from B to D, or {t + dt, (qi + Sqi) + d(qi + 5g<)} if we move from C to 
D. But they should physically correspond to the same quantity. Hence, equating these two 


Copyrighted material 




Principle of Leaat Action and Hamilton’s Principle 201 


expressions we can easily get, 

*(<**) = d(6 qi ) (6.4) 

Hence 6 and d commute. Again all the variations take place at the same instant t, therefore 
the variation of t is identically zero, that is, 8t = 0 , by definition for any value of t. 



Pig. 6.2 Infinitesimal segments of the real and varied paths 


Now 

Since S(dqi) 


S{dqi) = f>(qi dt) - 6qi dt + q t 8(dt) = 8fa dt 
d(8qi), we thus have 


d(8qi) = 8qi dt 


or 

^(H) = Hi (6.5) 

Thus the variation in g, is related to the variation in g, by the total time derivative of the 
latter. 

We must now understand what we mean by the first variation of the integral 





( 6 . 6 ) 


Here F is a differentiable function over the n-dimensional configuration space and Pi, ?2 
are two fixed points in the configuration space and the integral is to be carried out along a 
path C joining Pi and P 2 . The parameter t parametrises the path C. We are interested in 
the variation of the value of the above integral as the path C is varied slightly, keeping the 
end points Pi and P 2 fixed and also keeping the values of the parameter t at Pi and P 2 


Copyrighted material 



202 Classical Mechanics 


fixed (that is, all varied paths start from Pi at the same initial value of t and end at P 2 
at the same final value of t). 

Now choosing one path in the configuration space is equivalent to fixing the functional 
dependences ?>(<), i = 1,... ,n. Since the path passes through Pi and P 2 , we must have, 

9 «(<i) = q\ and 9 »(< 2 ) = 9 " (6.7) 

where q[ and q" are the coordinates of the points Pi and P 2 , and <1 and ( 2 are the values 
of parameter t at Pi and P 2 respectively, on the given path. In order to account for the 
possible variations in path, we make the functions qi depend on an additional parameter 
u such that a given value of u corresponds to a unique path, that is, in general, 

9 , = 9 ,(f,u) i = l,...,n ( 6 . 8 ) 


with 


9« = 0) = 9 i(f) 


for a special path characterised by u = 0 . A neighbouring path or a curve C(tt) corre¬ 
sponding to the parameter value u is given by 

9« = 9i(<,ti) = 9*(<) + «r(0 (6.9) 


where 


Vi( 0 = 



0 


Here we have expanded q,(t,u) in a Taylor series around u = 0 and kept the terms up to 
the first order only, since the variation is small. In order that C(u) also pass through Pi 
and P 2 , we require, 

rji(ii) = Vi(t 2 ) = 0 (6.10) 


Further, differentiating Eq. (6.9) with respect to t, we get 


9i(<,u) = 9i(<) + u q,(0 


( 6 . 11 ) 


where we have put 



The integral evaluated along the curve C(u) between Pi and P 2 is obviously a function of 
u so that we should write 



«)»&(*>«)*<) dt 


or, using Eqs (6.9) and ( 6 . 11 ), 



ft (0 + tu?<( 0 > 9<(0 + t irn(t))dt 


( 6 . 12 ) 


The integral ( 6 . 6 ) can be denoted as /(0) and hence, expanding /(u) in Taylor’s series 


Copyrighted material 



Principle of Least Action and Hamilton’s Principle 203 


around u = 0 we get 


/(u) = m +« (|Q o + o<« ! ) 


(6.13) 


where 0(u 2 ) represents terms of order of magnitude of u 2 . This suggests the definition of 
61 , the first order variation in / due to an infinitesimal change in the path characterised 
by u , and is given by 


/•*» / dl\ 

61 = 6 f(*. *,<)<« = i* 5r) 

Jt x 


(6.14) 


Differentiating the RHS of Eq. (6.12) with respect to u (assuming ti, fixed) and then 
putting u = 0 we get 


Therefore, 


© f^fdF , dF.\. 4 

... = J, \W<’ U + 5 **/* 

„ (9I\ [* 7 f dF 6F . 1 ,, 

" = “UL. = 1, + wH dt 


Now it is easy to see from Eqs (6.9) and (6.11) that 

6qi(t) = uin(t) and 6qi(t) = urii(t) 

giving 


“-ns** «'*)*-/> 

jd the first variation of the integral (6.6). N 
1 given by 

61' = 6 F{q,q,t)dq = 6 C' F^-d 
J Pi J P, 


(6.15) 

(6.16) 


The quantity 61 is called the first variation of the integral (6.6). Now consider the variation 
of the following integral given by 


Using Eq. (6.16) we get 



C 6 ( F in) di - /> + si’ F6m 

/•Pa /*Pj ^ i-Pj /-Pj 

/ 6Fdq + / F5-(^q)dt = / 6Fdq + / Fd(«q) 

^Pi ./r, 7r, yp t 


/•Pj rPj 

/ £Fdg + / 

JPi yp, 


F$(dq) 


(6.17) 


where we have used Eqs (6.5) and (6.4) (in that order). 

The results expressed in Eqs (6.16) and (6.17) can be easily appreciated if we recall the 
definition of the integral as the summed products of F and At or F and A q over the 
entire interval under consideration. In that case, the variation of the integral with the fixed 


Copyrighted material 



204 Classical Mechanics 


limits of integration will legitimately be transferred to the possible variation of the products 
FAt or FAq. Since the variation of dt is zero, we have only one term in Eq. (6.16) and 
two terms in Eq. (6.17). 

We shall use Eqs (6.16) and (6.17) extensively in this chapter. Note that while calculating 
( dl/du) u _ 0 we have assumed that the limits of integration (that is, end points of all the 
paths) are constants independent of u. Otherwise, according to Leibniz’s rule of differenti¬ 
ation under the integral, derivatives of these limits have to be accounted for. Indeed, as we 
shall see in section 6.2 , the variations at the end points need not always vanish, in which 
case the corresponding differentials appear in the expression for the first variation of the 
integral. If the end points of all the varied paths are not the same we have (tfqi)iniiiai ^ 0. 
Moreover under more general situations the two varied paths need not even start precisely 
at the same instant nor may they end at the same instant. The allowance for such variations 
in the timings of the initial and final instants (though generally very small and usually de¬ 
noted by (AfJiniUai and (Af) fina i ) sometimes play a crucial role in the variational studies. 
Now we come back to Lagrange’s proof of the principle of least action. This involves the 
Cartesian coordinates z* in spite of the fact that the motions may take place in presence 
of constraints, so that the arbitrary variations of all z* are not strictly allowed, as z, ’s are 
not all independent of one another. So only if there are no constraints, is the proof given 
below mathematically flawless. 

Let be the unit tangent vector to the tth curve representing the path for the ith 
particle at the point under consideration. Thus the variation of Lagrange’s action for the 
whole system of N particles can be written as ( as the Einsteinian summation convention 
implies) 


5 J VidsiJ 


= 6 ^ mi J ( V{ti) • (dj,i,)^ = 6 J • dr 


J Sv 

i ■ dr^j + 

m * / 

Vi • 6{dri) 


m< J 

dti + THi 

h- 

d(6ri) 


m i J Sri ■ 

dri + m,- 

ji(T 

i r 

i 

3 

■ Sri 


ridt -1- m, J d( 

f, Sri) - rm J (fi 

i • 6»\) dt 

1 /"*• 

Stidt + J 

f ViV 

■ Sndt^ - JviV 

• Sridt 


+ J Tmdlji • Sri) - J m,(f< • Sti)dt 
= J(6T + 6V)dt - J (rmfi + ViV) • btidt + J rmdifi • Sr { ) 

= J SEdt + J mid(ri-6ri) - J {m^i - F { ) • Stidt (6.18) 


Copyrighted material 




Principle of Least Action and Hamilton’s Principle 205 


Here F< = - V.V is the conservative external force that is applied on the ith particle, 
and E is the total energy of the system, E = T + V. Now, if 

(i) the energy is kept constant for all the varied paths, that is, only those variations are 
taken for which total energy E of the system is the same for all the varied paths and is a 
constant of motion for each of the paths, 

(ii) the end points are fixed, that is, £r, = 0 at both the ends,that is, 


i: 


mid(ri -6ti) = 7n»(rj • tfr*) = 0 


and 

(iii) D’Alembert’s principle is valid, that is, 


(m,-fi - Fi) ■ Sti = 0 


then all the three terms on RHS of Eq. (6.18) vanish giving 

6 miVidsi = 0 


(6.19) 


Again since = m,*, • di\ = p, • vrft , Eq. (6.18) can also be expressed as 

= 0 


t (Pi • Vi)dt 

In terms of generalised coordinates this means 


6 Pi * 


PiQidtl 


‘j: 


PidqA 


= 0 


( 6 . 20 ) 


The principle of least action expressed as in Eq. (6.19) is sometimes called Lagrange’s 
principle of least action. Since pj£, = 2T, another equivalent form of Eq. (6.20) is 


/ 


2 Tdt\ 


= 0 ( 6 . 21 ) 

Ifi = const. 

which is sometimes called Jacobi’s principle of least action. One more equivalent form that 
ensues from Eq. (6.19) is 


f y/2mi(Ei - Vi)dJ = 0 (6.22) 

IS = constant 

where E{ = j77i,t>? + Vi, and Vi are the total energy and the potential energy respectively 
of the ith particle. Some books refer to this as Jacobi’s principle of least action. 

Remarks: 

1. The condition (iii) for the applicability of the principle of least action should not be 
interpreted as a validity check on Newton’s second law of motion as Fi in Eq. (6.18) do not 
contain any constraint forces and hence do not represent the total force. The above forces F, 
are the externally applied forces and are conservative in nature. The validity of D’Alembert’s 


Copyrighted material 



206 Classical Mechanics 


principle requires that (i) the quantities must be evaluated in the inertial frames only, 
and (ii) the system must be D’Alembertian, that is, holonomic or at most a special class 
of non-holonomic systems in which the velocity dependence in the constraint relation is 
homogeneous. One should, however, remember that Lagrange’s proof itself becomes invalid 
for any constrained system. 

2. The end points do not have the same interpretation for all the principles of least action. 
For example, S f* Pidqi = 0 means that the integration is to be performed over dqi and 
the end points are ( qi)i and ( qi )/ respectively, whereas for Jacobi’s principle of least action 
/* 2 Tdt - 0 , the integration is to be perfomed with respect to time and the end point 
timings are fixed. Moreover, it is also required that the condition (ii) be satisfied, that is, 
the terminal coordinates must be fixed. 

3. We also require that / 2 SEdt = 0. The easiest way to satisfy this condition is to set 
SE = 0 for the entire route, implying that the system must be conservative. 

4. Most of these principles of least action can be expressed as 6 /* mvds = 0, i.e., the 
specification of the integrand requires the knowledge of the momentum p as a function of 
E and spatial coordinate r, but not the knowledge of the explicit time dependence of r(t). 
So if one is basically interested in the trajectories of the particles, rather than a complete 
solution to the motion, these principles of least action are ideal for such a purpose. 


6.2 HAMILTON’S PRINCIPLE 


We have seen that all the principles of least action stated above in different forms have very 
limited applicability. These seem to be far more restricted than D’Alembert’s principle. 
First, one has to do away with the requirement that the system should be conservative. In 
other words, the term J x SEdt in Eq. (6.18) should be expressed in a different way. This 
is done in the following way. 

Consider a term like S / 2 Edt. Obviously a definite integral J * 2 Edt should be a function 
of the limits of integration. Mathematically speaking, the variation of this integral can, in 
general, have two contributions : one coming from a term /* SEdt = the variation in 
energy between the real and the varied path integrated between the time limits and the 
second, Ef A ft - E a A a t, originating from the time variation, if any, between the two paths 
at the terminal points. Therefore, 


^ SEdt = 6 Edt - [E,A f t - E a A a t] 

where A ft and A a t are the variations in the limits of integration over t. In the derivation 
of Eq. (6.18) we have everywhere used the criterion that all S variations are virtual (no 
change in real time). Here we have relaxed the condition only for the end points (in time). 
Now writing the LHS of Eq. (6.18) as 

6 jf miVi • dti = S TmVi • Vidt = S Ji 2 Tdt 


Copyrighted material 



Principle of Least Action and Hamilton’s Principle 20? 


and rearranging Eq. (6.18) and using T + V = E, we get 

6 J\t - V)dt = J* midpi -Sti) - j\rniTi - F { ) ■ Sndt - [E f A f t - E a A a t] 

1 1 1 (6.23) 

the left-hand side of which is /* Ldt, where L is the Lagrangian of the system. 

Now we can have 6 / 2 Ldt = 0; from Eq. (6.23) provided the following conditions are 
simultaneously satisfied namely, 

(i) all the coordinates of the end points are fixed , 

1/ la 

(ii) A a t — Aft = 0 , that is, both the terminal time instants are fixed but now it does 
not matter whether energy is varied between the real and the varied paths or not, and 

(iii) D’Alembert’s principle of motion is valid (tentatively); the exact requirement of this 
third condition needs further investigation. 

The above Lagrangian L = T — V can easily be generalised to L = T - U 
as we know that U differs from V by a gyroscopic force term which does not do any 
work, so its inclusion on both sides of Eq. (6.23) is permissible. Unless given a priori , one 
always calculates L from its Cartesian description of velocities which are known functions 
of generalised coordinates and velocities. But as soon as one writes L in terms of the 
generalised coordinates and velocities, all q, and q, ’s become independent of one another 
for all holonomic bilateral systems, and the use of the variational principle with all 6q, ’s 
being mutually independent becomes fully justified. However, since one cannot, in principle, 
start with a quantity tfr, for a constrained system, it is impossible to rigorously prove this 
principle. 

The above form of the least action principle, namely, 


6 j\{q,q,t)dt = 0 (6.24) 


is called Hamilton’s principle of least action (or Hamilton’s principle in short) where the 
end points of the paths are fixed both in space and time but the energy at any point of the 
varied path need not be the same as at the corresponding point on the real path. Hamilton 
suggested this principle in 1834. This principle is so important that it is worth stating again 
in words. 

Hamilton’s principle: 

A dynamical system moves from one configuration to another in such a way that the 
variation of the integral, /* Ldt (L being the given Lagrangian of the system) between the 
actual path taken and any neighbouring virtual path, coterminous in both space and time 
with the actual path, is zero, or in other words, /* Ldt is stationary. 

The point to be noted here is that Hamilton introduced it as a principle to be obeyed by 
nature and did not specify the exact domain of its validity. From its inception, this principle 
has assumed an axiomatic status. It is certainly valid for an unconstrained system which 
is obviously D’Alembertian, but whether or not it applies to the general D’Alembertian 
systems is yet to be seen. 

ThuB there are basically two working forms of the principle of least action. 


Copyrighted material 



208 Classical Mechanics 


1. Lagrange’s principle of least action 



Pkdqk 


= 0 


(6.25) 


which is restricted to the conservative systems only, and 
2. Hamilton’s principle 

6 f * L(q,q,t)dt = 0 (6.24) 

Jti 

which is certainly restricted to the D’Alembertian systems. Whether further restrictions are 
to be imposed or not is considered in section 6.4. 


0.3 COMPARISON BETWEEN FERMAT’S PRINCIPLE OF LEAST ACTION 
IN OPTICS AND MAUPERTUIS’ PRINCIPLE OF LEAST ACTION IN 
MECHANICS 

The famous French mathematician Pierre de Fermat (1657) gave his principle of least action 
for the path of light rays, which states simply that 

6 J ^ = 0 (6.26) 

where v is the speed of light at a point in an optical medium and da is the displace¬ 
ment measured along the path of the ray at the corresponding point. Since light travels 
with a fantastically great speed, one need not bother about its actual motion in time; the 
determination of its trajectory is sufficient, which is what Fermat’s principle offers us. 

Maupertuis’ principle of least action for material particles, on the other hand, requires 
the condition 

6 

to be satisfied if the particle is moving in a conservative force field and if the varied paths 
correspond to the same energy of the system as the real path. So this principle can be used 
to determine the trajectory of a material particle. Now the most interesting thing to note is 
that these two principles of least action have exactly opposite dependence on the speed v 
in their integrands. It certainly raises the doubt as to whether or not light can be regarded 
as a particle in the true mechanical sense. 

In the last quarter of the seventeenth century, the Dutch physicist Christiaan Huygens 
and the English physicist Sir Isaac Newton were caught in a debate as to the nature of 
light rays; whether light rays are propagating waves or traveling corpuscles. Snell’s law of 
refraction of light (given by the Dutch physicist Willebrord Snell in 1621) was found to be 
derivable not only from Fermat’s principle and Huygens’ wave theory of light but also from 
Newton’s corpuscular theory. Now the question is, how could it happen? In order to satisfy 
Snell’s law of refraction, Fermat’s principle suggests that the speed of light has to be lower 
in the denser medium and the refractive index of any homogeneous medium n = (c/v) > 1, 


i: 


vda - 0 


(6.27) 


Copyrighted 



Principle of Least Action and Hamilton’s Principle 209 


where c is the speed of light in vacuum and v the speed of light in the refractive medium. 
On the other hand, the corpuscular theory of light following Maupertuis’ principle of least 
action, suggested an opposite relation, namely higher speed of corpuscular light beam in 
the denser medium (see problem 6 . 6 ). This debate remained unsettled until Fizeau in 1851 
did an experiment to measure the actual speed of light in water and Newton’s corpuscular 
theory was abandoned thereafter. So the trajectory of the light rays follows from Fermat’s 
principle of least action, not from Maupertuis’. 


6.4 DERIVATION OF EULER-LAGRANGE EQUATIONS OF MOTION 
FROM HAMILTON’S PRINCIPLE 


In order to use Hamilton’s principle as expressed in Eq. (6.24), namely, 


L{q,q,t)dt = 0 


where q and q stand for the sets of n generalised coordinates and n generalised coordinate 
velocities respectively, the only thing one needs is the specification of the Lagrangian of the 
system in its explicit form. Thus we construct an n-dimensional configuration space and 
varied paths are drawn in it. As the end points of all varied paths are coterminous both in 
coordinates and time, we get, as in Eq. (6.17) 

6 J* Ldt = J* 6 Ldt + J* L6(dt) 


The second term on RHS vanishes because 6(dt) = 0 for all points of the varied paths 
including the end points. We have, 


. r 9L ^ 9L 9L 

u = Wi S,i + w, 1 * 


provided all 4 ,-, ^ and t 
bilateral holonomic system, 
write 

6L = 


are strictly independent arguments, which is true only for a 
Again, the third term on RHS vanishes as St = 0 and we can 



It should be noted here that although 4 , and are all independent of one another, 6qi 
and 6qi are not because of the variational identities Sqi = d(Sqi)/dt. Hence even though 
L = L(q,q,t ), S Ldt becomes finally dependent on SqC s, t at t f , q ia and 9,7 . Thus we 


Copyrighted 



210 Classical Mechanics 



The first term on RHS vanishes because Sqt at the terminal points 1 and 2 are zero. Thus 
Hamilton’s principle requires 



Since g, ’s are all independent of each other, 6qi can be arbitrary and the above equality is 
satisfied only if 



which are the well-known Euler-Lagrange’s equations of motion for any bilateral holonomic 
system having no nonpotential forces and n DOF. Obviously the definition of L does not 
and cannot contain any effect of nonpotential forces. Euler had used the above variational 
procedure in order to derive the same equations of motion as were derived by Lagrange 
starting from D’Alembert’s principle. This is the reason why we call them Euler-Lagrange’s 
equations of motion. 

For nonholonomic systems, the qi ’s are not all independent of each other and hence 6qi ’s 
cannot be taken as independent variations of qi ’s. So Hamilton’s principle is unusable, 
and hence in practice cannot be applied to nonholonomic systems. In this sense Hamilton’s 
principle is more restricted than D’Alembert’s principle for tackling mechanical problems. 
However, the variational principles do not exclude their applicability to systems having 
infinite DOF. All modern field theories, being examples of infinite DOF and spanning almost 
the entire physics, are, in fact, founded upon the versatile use of Hamilton’s principle. This is 
considered to be the greatest advantage of Hamilton’s principle over D’Alembert’s principle. 


6.5 DERIVATION OF HAMILTON’S EQUATIONS OF MOTION FOR 
HOLONOMIC SYSTEMS FROM HAMILTON’S PRINCIPLE 

We start by requiring that 

S £ Ldt = jT (6L)dt = 0 

The relation between the Lagrangian and the Hamiltonian is given by the usual Legendre 
transformation 

£( 9 , 9,0 = PiQi ~ H(q,p,t) 

Therefore, in terms of the Lagrangian and the Hamiltonian variables, 

6L = piSqi + qrfpi - 

= - (p< 


dH . dH c 

W, Sq ' - w, hi 


dH\ c (. dH\ e 

W.) Sq ' + V' - WJ 6 ”' 


Copyrighted material 



Principle of Least Action and Hamilton’s Principle 211 

the variations of L now being expressible in terms of Sqi and Spi through the Hamiltonian 
function. Both can be arbitrary, provided we consider the variation of the paths in a 2n - 
dimensional phase space and no longer in the n-dimensional configuration space. Through 
this, we are now allowing Spi = d(Spi)/dt together with 6q q = d(Sqi)/dt. Thus we have, 

s l ui - l - M?i|? ■ / (* + w ) 6 * + / (* - SSf)* 

The first term on RHS vanishes since Sqi = 0 at the terminal points 1 and 2. Since 
the system is holonomic and is now described in the phase space, q,’s and p,’s are all 
independent, and, Sqi's and Spi's are arbitrary at all points of the path. It should be noted 
that the vanishing of 6qi at the terminal points does not imply that bpi s are also zero at 
the terminal points. In fact they do not generally vanish. All the above integrals can vanish, 
only if 

dH 8H 

* = - W and « = Wi 

which are Hamilton’s equations of motion. 


0.6 INVARIANCE OF HAMILTON’S PRINCIPLE UNDER GENERALISED 
COORDINATE TRANSFORMATION 


We have already seen earlier that if we add the total time derivative of any function of the 
form F{qi ,..., q n , t) to the Lagrangian of a holonomic system, Euler-Lagrange’s equations 
of motion remain unchanged. This fact can be demonstrated in a more straightforward 
manner in the following way. Let 


L'{qi, -,q n ;qi,--,qn-,t) = L(q u ...,q n iq u ... } q n ;t) + 
Therefore, 

= l J Lit + 6\F( qi . 

The last term on the RHS vanishes because 


cr , 9F e dF 
SF = Wi* + ~8t st 


is zero when evaluated at both the end points. Therefore, 


6 L'dt = 6 Ldt = 0 


Thus the Lagrangian L' = L + dFJdt is also subject to Hamilton’s principle and therefore 
must lead to the same form of Euler-Lagrange’s equations of motion as L. 


Copyrighted material 



212 Classical Mechanics 


Let us consider a general coordinate transformation from a set of independent coordinates 
and time ( 91 ,...,q n ,l) to another independent set (Qi,...,Q n ,r) given by 

qi = -,Qn,T) and t = - •• ,Q n ,r) i = 1(6.28) 

The same old Lagrangian L changes only its form to L(Q 1 ,..., Q n , r) on mere substitution 
of Eq. (6.28) in L. Now, if the total physical content or the message of Hamilton’s principle 
is not to change under this transformation (6.28), then we must have 

.,„,()* = 6 j* L(Q u ...,Q„,T)^iT = 6 f' L'(Q u ...,Q„,T)iT 

with the result that the new Lagrangian L * must be given by 

£•(<?!.<?».r) = i(«!.= I(«i,(6.29) 

This is the required condition on the Lagrangian for any generalised coordinate and time 
transformation, which ought to preserve the form of Euler-Lagrange’s equations of motion. 


6.7 HAMILTON’S PRINCIPAL AND CHARACTERISTIC FUNCTIONS 


Let us write Hamilton’s principle for any holonomic bilateral system as 


SW 


Ldt = 0 


The first of the above equalities defines a function 
W = f Ldt 


(6.30) 


The function W as defined above is called Hamilton’s principal function. We have already 
seen that Hamilton’s principle is valid if both the terminal points are fixed in space and 
time. Hence W must be a function of the sets of generalised coordinates of the initial and 
final points (that is, of points 1 and 2 ) in the n-dimensional configuration space and of the 
initial and final instants of time at which the particle was at the initial and final points 
respectively. Thus we can write, 


W = W( 9lo ,..., gna ; 9l /,..., 9n/ ; t„i,) (6.31) 

for any bilateral holonomic system having the number of DOF = n. However, this is also 
valid if we are working with a phase space of dimension 2 n. 


In terms of generalised coordinates and momenta, we can now evaluate the general vari- 


Copyrighted material 



Principle of Least Action and Hamilton’s Principle 213 


ation of W in the phase space in the following way. First we start with, 

6 Pidqi = Spidqi + pi6(dqi) = j 6piq»dt + Pid(Sqi) 


/ 2 QJJ r2 fl 

6pi ~dp dt + J d ( p ' 6q ') ~ j PiSqidt 

= l {Wi Spi + Wi 6q ') dt + 

= J sHdt + m ; 


6 W 


= 6 j\idqi - Hdt ] 

= J SHdt - 6 J 2 Hdt + \pi6qi}] 

= - + \pi6 qi ] f 


(6.32) 


M?»)« 

where the suffix / corresponds to the final terminal point and the suffix a corresponds 
to the initial terminal point. Here A/<, A„t, (Sqi)f and (Sqj) a fort = l,...,n can be 
arbitrary and hence are independent of each other. Therefore, any arbitrary variation of 
W can be fully interpreted using a subset of the phase space. In fact the n-dimensional 
configuration space will suffice to describe the complete time evolution of the function W . 
Hence, we must have, for 6W, from Eq. (6.31), 


SW 


dW 

dqi 


M 


d\V . dW 

+ w t q>l + du At - 


dW 


Comparing term by term for Eqs (6.32) and (6.33), we get, 
dW 


DW 


dW 

rr( 0W\ 

dtf " H \ qif 'dq if ) ~ 

dW „( ow\ 
*;- H {*"*-) =° 


(6.33) 

(6.34a) 

(6.346) 

(6.34c) 

(6.34d) 


Thus Hamilton’s principal function can be obtained by solving (simultaneously) the last two 
of the above equations which are the first order partial differential equations, and using the 
first two as supplements. 

Now Hamilton’s characteristic function is defined in a similar way from Lagrange’s princi- 


Copyrighted material| 



214 Classical Mechanics 


pie of least action, namely, 6 Pidqi = 0 which is valid if SH = 0, or H = E = const, 
over the varied paths whose end point coordinates are fixed. Hamilton’s characteristic func¬ 
tion is defined as 



S(qia, • • • , Qua] 9l/> • • • , Qnf]E ) 


with the differential conditions 


dS 


= - Via 


(6.35a) 


dS 


= Pi/ 


and H = E corresponding to the first order partial differential equation 


and 


# («r/ 


as \ 


(6.356) 


(6.36a) 


(6.366) 


’ %7/ 

It should be pointed out that the notations S and W are just the opposite in Goldstein’s 
book. 


It was initially Hamilton who suggested that a pair of the partial differential equations 
(consisting of the initial and final sets of coordinates and time as independent variables) 
as given by Eqs (6.34c,d) or (6.36a,b) are to be solved for either the W function or 5 
function, as the case may be. One has to first obtain the correct complete integral for the 
required set of simultaneous partial differential equations. Because, a complete integral to 
the initial-value-related equation will contain n arbitrary nontrivial constants, and similarly 
the final-value-related equation will also contain another n arbitrary nontrivial constants. 
The form of the integral has to be such that the constants of integration do exactly take the 
place of the initial values qi a and p, a , thus duly satisfying the other two sets of the equations, 
namely, Eqs (6.34a,b) or Eqs (6.35a,b). This is really a very difficult task, making Hamilton’s 
prescription the most formidable one, even though the solution, if obtained, would represent 
the most complete one indeed. 

At this point of impasse, Jacobi (1845) demonstrated two things: one that for conservative 
systems, one can choose either of the W formalism and the 5 formalism without sacrificing 
any degree of completeness of the solutions thus obtaied, and the other that for a given 
formalism, two partial differential equations, namely the initial and the final value-related 
equations need not be solved simultaneously. Without losing generality, he claimed, it is 
sufficient to solve only the final-value-related equations as the equations in the running 
variables, so that the final solutions would look like either S = S(qi , E) or W = W(qi,<), 
as the case may be. The initial-value-related equations are therefore all redundant. Jacobi 
further pointed out that there was also no need to look for any special complete integral 
with the right type of the constants of motion. In fact, any complete integral with a set of 


Copyrighted 



Principle of Least Action and Hamilton’s Principle 215 


n + 1 arbitrary constants coupled with the equations 

8W . d\V 

*T " " and ^ = ft 

was shown to be equivalent to Hamilton’s 2n equations of motion, both solving the problem 


equally exactly and completely. Here ai are the n nontrivial constants of integration that 
come with the complete integral for W. The proof and details of Jacobi’s method will be 
considered in chapter 10. 

Therefore, we have, for the S 

formalism: 



dS 

* =Pi 

(6.37a) 

and 


(6.376) 

and for the W formalism: 

aw 

Si = Pi 

(6.38a) 

and 

H 

( ew \ aw 
{«'Si’ 1 ) + ST - 0 

(6.386) 


The first-order partial differential equations giv n by Eqs (6.37b) and (6.38b) are respectively 
called the time-independent and the time-dependent Hamilton-Jacobi equations. 


6.8 NOETHER’S THEOREM 


We have already introduced Noether’s theorem in section 2.13. Here we shall give its formal 
statement and the proof. 

Theorem 

If for an infinitesimal transformation of the generalised coordinates qi of a holonomic 
system and of time t of the form 


Qi = Qi + t*i(Qi, 0 
i' = t + «*(*,<) e-0 

Hamilton’s principal function is invariant, that is, 

S C L {'“'% t ) dt = (*#•'')*' 

then the quantity 

dL .. * T , 

~ " L * 


(6.39) 


(6.40) 


Copyrighted material 



216 Classical Mechanics 


is an integral of motion. 

Proof 

We know that the variation of Hamilton’s principal function stands for the variation 
of the terminal coordinates and time, that is, 


SW 


Jt i 


L(q,q,t)dt = 


Sqi + 


8W . 

1 w 1 


If we now interpret Sqt and St which are infinitesimal variations or displacements in 
the values of the coordinates and time of the original frame, to be effectively equivalent to 
the ones generated by the infinitesimal coordinate and time transformations given by Eq. 
(6.39), then Sqi = - and St = - tx{q,t) for all points of the path including 

the end points. This is called a changeover from an active to a passive viewpoint. Let us 
digress here for a moment in order to clarify this point with an example. 

Suppose a rigid body rotates about an axis represented by the unit vector n by an 
amount A 9 in time At. The position vector of any particle at r will now actively change 
to r + Ar = r + (» x r)A0. But the same change in coordinates can be effected by 
rotating the coordinate axes about n by an angle - A0. The former viewpoint is called 
an active viewpoint and the latter a passive one. Note that there will be a change of sign. 

So the requirement of the invariance of W under a passive coordinate and time transfor¬ 
mation given by Eq. (6.39) can be effectively viewed as one under an active displacement in 
both coordinates and time, with a change of sign. Thus, SW = 0 under the transformation 
Eq. (6.39) will effectively mean 


8W x , dW . 

—etfi(<7,0 + ~0f £ X(<7>O 


dW _ . . 3W . , 

+ ~dT cxiq ' ) 


Now since c is arbitrary, p, =• dW/dqi and H = — dW/dt, we must have, for all points 
of the path, 

Pi*i(q> 0 - Hx(q,i) = const, 
or 

~ L ) x{q ' i} = con5 ‘- 
or 

0-r(ttX(9,<) - *(9,0) ~ L\(q,t) = const. 

This theorem is valid for all bilateral holonomic systems. Noether’s theorem in the above 
form can further be generalised to the case when the transformation of coordinates and time 
changes Hamilton’s principal function in the following way; 


U%t)it = / 

*\l(, 

tM , e ) 

l + ^'’‘'>1 

V <u ) J , 

1 l V 

u iv' ) 

1 dV 


that is, when W is invariant under the Lagrangian gauge transformation, to a new set of 


Copyrighted material 



Principle of Leaat Action and Hamilton's Principle 217 


coordinates and time. Obviously, this would simply add F(q,t) to Eq. (6.40) giving 

- #i) - Lx + F = const. (6.41) 

This is the most general form of Noether’s theorem first given by Emmy Noether in 1918. 

Now for any closed system, all the conservation laws due to homogeneity of space and 
time and isotropy of space would follow immediately from Eqs (6.40) and (6.41), as shown 
below. 


(i) For the homogeneity of time, we put q\ = qi and t' = t + e so that Vi = 0 and 
X — 1 and the Eq. (6.40) gives 

8L 

•xr-qi - L = const. 

dq% 


This is the law of conservation of energy for a closed system. 


(ii) For the homogeneity of space, we use Cartesian coordinates with the infinitesimal 
coordinate transformation x\ = Xi 4- e and t' = t, that is, tf,- = 1 and \ = 0 for any 
particular t, giving 

dL 

-xr- = const. or pi = const. 

So the component p< of the linear momentum corresponding to any Cartesian coordinate 
Xi is conserved for a closed system. 

(iii) For the isotropy of space, we choose any particular generalised coordinate q,- as the 
angle 9 and q[ = qi + e, t' = t with Vi = 1 and x = 0 giving 


8L 

dqi 


4 dL 

const. = jr 


Thus the angular momentum corresponding to the 0-rotation is constant. 


(iv) For invariance under the Galilean transformation, we must use the form given by 
Eq. (6.41). Taking t' = t, x 1 = x - et so that x = 0, V{ = — t, and the required 
F = — mx, for a single particle moving in the x-direction, Noether’s theorem in the form 
of Eq. (6.41) says that 

- mxt + mx = const. 


Actually, for a system of particles, 


ErriiX i - X)£j t = const. 

which is known as Galilean translational invariance. 


6.9 LORENTZ INVARIANCE OF HAMILTON’S PRINCIPAL FUNCTION 
FOR THE RELATIVISTIC MOTION OF A FREE PARTICLE 

The special relativistic metric in 4-space is given by 


Copyrighted material 



218 Classical Mechanics 


ds 2 = c 2 dt 2 - dx 2 - dy 2 - dz 2 = c 2 dt 2 



Therefore, 


Define 





(6.42) 


and use the expression for the Lagrangian for a free particle given by Eq. (5.15) to get 


W l2 = - 




m 0 c ds 


(6.43) 


Since mo and ds are Lorentz invariants and c is a constant, W\ 2 for the motion of a free 
particle is an invariant quantity. 

Now HW = 0 for the motion of any free particle would imply 


S 


£ 


ds 


= 0 


(6.44) 


which is the condition for motion along the ‘straightest’ or geodesic paths, by definition. 
Thus all free particles follow geodesics in the space-time continuum. In fact, it can be shown 
that Euler-Lagrange’s equations of motion for a free particle, described in the configuration 
space (which is now a metric space, by default), are also mathematically equivalent to the 
geodesic equations of motion of the same test particle in the metric space. This is how the 
dynamics and the geometry fused together in the hands of Einstein. 


0.10 SIGNIFICANCE OF HAMILTON’S PRINCIPLE 

1. Hamilton’s principle is a novel and powerful technique for solving a wide variety 
of dynamical problems. Unlike all the other techniques, this one does not start with a 
differential equation, rather it starts with an integral which is then optimised against some 
possible variations of the path. The original motivation of Maupertuis was to glorify God’s 
grand design through his action principle. He argued that differential equations, such as 
Newton’s equations of motion (or Euler-Lagrange’s or Hamilton’s equations of motion of 
later days), assign the system under consideration the amount of force or acceleration at 
every instant of time so that the system evolves in time bit by bit, not knowing where exactly 
it will finally arrive. On the other hand, the action principles require the end points be known 
first, then out of all possible paths, nature follows the one for which a particular integral is 
an extremum. Any such explanation of natural events is called a teleological explanation, in 
which a well-defined purpose works behind every perceptible motion or change. 

However, a purely mechanistic explanation for this kind of teleological arguments is also 


Copyrighted material 



Principle of Least Action and Hamilton's Principle 219 


possible. Mathematically speaking, the process of integration is the inverse of that of dif¬ 
ferentiation. The very process of taking the ‘variation’ of an integral returns the original 
differential equation. It is not surprising that they yield the same result mathematically. 
Physically speaking the principles of classical mechanics are time symmetric and determin¬ 
istic. One would have been surprised, had the integral variational techniques not provided 
the same results as their differential counterparts. 

2. Hamilton’s principle can be used for holonomic systems as well as for systems having 
infinite degrees of freedom. 

3. Once the Lagrangian is correctly formulated, Hamilton’s principle brings out all the 
essential dynamical features of the system, even though the choice of Lagrangian is not 
unique. In chapter 2, we have seen that changing the Lagrangian by a gauge term essentially 
implies changing the energy and momenta while keeping the basic form of Euler-Lagrange’s 
equations of motion intact. Hence, the specification of the Lagrangian ought to convey more 
complete information than merely the differential equations of motion. A dynamical system 
chooses the natural path in such a way that the variations of the kinetic energy integral 
f?Tdt and that of the potential energy integral J?Vdt between two different, closely 
separated paths are equalised as closely as possible. By no means is this to mean that the 
system cannot be conservative. One may recall that for conservative systems T + V = E 
is conserved along any path of evolution, but we are considering 6 J(T - V)dt = 0 for 
two neighbouring paths, the former change being along the path, the latter change being 
across the path. The former may or may not preserve (T + V), whereas the latter always 
preserves (T - V) at any given point on the real path under infinitesimal variations. This 
is the microscopic essence of Hamilton’s principle. 


0.11 SUMMARY 

Maupertuis’ principle of least action can be viewed as generating the equations for the 
trajectories of particles, rather than the time evolution of their orbits. This is not surprising, 
because the main inspiration came from Fermat’s principle and from the fact that light rays 
travel so fast that they leave only their tracks behind. T*te time evolution of the trajectories 
become irrelevant for light rays. Jacobi’s principle of action also leads to the equations 
for the trajectories of particles of matter in phase space. It is only Hamilton’s principle 
that deals with the time evolution of the trajectories and is capable of giving the complete 
information. 

It is shown that Hamilton’s principle can generate Euler-Lagrange’s equations of motion 
and also Hamilton’s equations of motion. In the first case, the varied paths are chosen in 
extended configuration space, but in the second in extended phase space. In both cases, 
only the variations in terminal coordinates (not in the terminal momenta in general) vanish. 

Hamilton’s principal function for fixed initial values of generalised coordinates and time 
represents evolving surfaces in the configuration space. At any instant, the system is located 
at a definite point on this constantly evolving surface. The momentum of this system at the 
given instant points in a direction perpendicular to this surface, and therefore the trajectory 
of the particle always remains perpendicular to the evolving surface of the principal function. 


Copyrighted material 



220 Classical Mechanics 


However, if we plot Hamilton’s principal function in the extended configuration space, it 
becomes a fixed surface. Any two points on this surface are assured of having connected by 
at least one real path among all the possible varied ones. This natural path is invariably a 
kind of geodesic described on the surface between the two given points. 


PROBLEMS 


6.1 Using the variational principles, derive the equations for a stable equilibrium config¬ 
uration of a uniform rope or a necklace hung between any two given points in the 
constant gravity field of the earth. 

6.2 Show that the arc of least distance, called the geodesic, between two given points on 
the earth’s surface having the same nonzero geographical latitude always appears to 
be convex towards the nearer geographical pole. Assume the earth to be spherical. 

6.3 A particle of mass m moving in the potential field due to constant gravity, V(z) = 
- mgz travels from the point z — 0 to the point z = zq in time Find the 
exact time dependence of the position of the particle, assuming it to be of the form 
z(t) = At 2 + Bi + C, and determining the constants A, B and C such that 
Hamilton’s principle is obeyed. 

6.4 For a coordinate and time transformation of the Minkowskian type given by x = 
X cosh 9 + T sinh 9, t = X sinh 9 + T/ cosh 9 , 9 b eing a constan t, show that 
the relativistic Lagrangian for a free particle L = - m oV /l - ( dx/dt ) 2 , (c is taken 
to be unity) remains invariant. 

6.5 Show that for a given Lagrangian of the from 

L(q,q,t) = t Uj(q)qiqj - V(q u ...,q n ) 

and a time transformation given by t = AT, with a scaling factor A, the invariance 
of Hamilton’s principal function W = /* Ldt with respect to A variation leads to 
the vanishing of the Hamiltonian of the system. 

6.6 Show that Maupertuis’ principle of least action 6 /* vds = 0 for any particle of 
mass m and energy E passing through a surface of discontinuity that induces an 
abrupt change in the potential energy from a constant value V\ to another constant 
value V 2 leads to instantaneous changes in the speeds across the boundary given 
by Vi : i>i = sin t : sin r, where i is the angle of incidence and r the angle of 
emergence, both measured with respect to the normal. Derive the result from first 
principles also. 

6.7 A particle is moving in a plane wave like external field given by the Lagrangian 
L = \mv 2 — V(r — at), u being the velocity of wave propagation. Use 
a transformation that leaves the potential V invariant and show that the ensuing 
conservation law is E - u ■ p = const. 


Copyrighted 




Principle of Least Action and Hamilton’s Principle 221 


6.8 Suppose that a particle is moving in a potential field that is a homogeneous function 
of r and is given by U(ar) = a n U(r ), a being arbitrary and n the degree of 
homogeneity. If action has to remain invariant under transformations r' = and 
t' = 0 2 t } show that p r - 2Et is a conserved quantity. 

6.9 Take the Schwarzschild metric 

** = * f 1 " ^) d ‘ 2 - 1 - (2CM/rc 2 ) “ + 6i " !W0!) 

around a static body of mass M, and calculate the Lagrangian for the motion of a 
test particle of mass m moving in the field of M. 

6.10 Taking the Lagrangian for a free particle in the form 



apply Hamilton’s principle or straightway use Euler-Lagrange’s equations of motion 
to show that *< + CijkXjik = 0 become the equations of motion, where 

fyjk 
dxi 

are the so called Riemann-Christoffel symbols. 


r . _ l n.\ d9kl _L 9gij 

Ctjk - 2 9 ' 1 [ + dT k 


Copyrighted 



7 

Brachistochrones, Tautochrones 
and the Cycloid Family 


7.0 INTRODUCTION 

About thirty years before Newton’s Principia was published, Fermat proposed his principle 
of least action in optics. It was then not known whether light propagated as waves or as 
streams of particles. About twenty years after Fermat’s proposition, Huygens hypothesised 
that light travels as waves, and the Fermat’s principle took the form as shown in Eq. 
(6.26). Obviously, 6 J ds/v = 6 J dt = 0, suggesting that the paths of light rays are the 
quickest possible routes between any two given points. For matter particles, this can be 
realised through Hamilton’s principle if and only if the Lagrangian of the matter particle 
is a constant of motion. For natural motions in the presence of external fields, the above 
condition is usually not satisfied, but the motion can be constrained to follow paths of 
widely varying shapes. It then became an interesting problem to find the required shape 
of the constrained path in the field of uniform gravity that would take minimum time to 
cover between two given points. Within ten years of the publication of the Principia, this 
problem came as an open challenge from Jean Bernoulli. One evening, Newton also came 
to know about it and sent the solution to the proposer without signing the reply. Finding 
paths of quickest descent is an interesting topic by itself. We plan to present it in the form 
of a separate chapter, rather than as a section to the previous chapter. 

One comes across the name of Bernoulli so often that one forgets that not all Bernoullis 
are the same person. There are essentially three Bernoullis, father, uncle and son, who 
continued to exist from the late seventeenth century through the late eighteenth century. 
Their first names are not spelt consistently in the existing literatures. Jacques (also spelt 
as Jacob or Jakob) (1654 - 1705) and Jean (or John or Johannes) (1667 - 1748) are the two 
brothers born in Basle, Switzerland. The elder brother was the discoverer of logarithmic 
spirals, caustics and evolutes, transcendental curves, isoperimetry, infinite series and finite 
sums, the problems of catenary and isochrones, etc. The younger brother is known to be 
the inventor of integral calculus, calculus of variations, the problem of brachistochrones, 
and he defended Leibniz very strongly in his controversial claim of being the true inventor 
of differential calculus. Jean is the father of Daniel (1700 - 1782) who gave us the famous 
Bernoulli equation of motion of perfect fluids. 


Copyrighted material 



Brachistochrones, etc. 223 


7.1 THE ‘CHRONE’ FAMILY OF CURVES 

In 1696, Jean Bernoulli proposed the famous brachistochrone problem (brachisto = shortest; 
chrone = time): * 

Given two points, to find the curve(s) joining them along which a particle starting from 
rest slides under constant gravitational force, in the least possible time. 

The solution, namely a cycloid, was obtained by Jean Bernoulli himself, his elder brother 
Jacques Bernoulli, Leibniz, Newton and L’Hospital. In order to tackle this problem more 
rigorously the calculus of variation was further developed by Euler and Lagrange. 

More generally, a brachistochrone is a curve joining two points along which a particle 
moves under the action of a given conservative force field in the least possible time. 

In 1686, Leibniz posed the problem of tautochrone or isochrone (tauto or iso meaning 
same or identical). This is a curve such that a particle starting from rest takes the same 
time to slide along the curve to a special point on that curve, irrespective of its starting point 
on the curve. 

The tautochrone problem was also solved analytically by Leibniz and Jacob Bernoulli in 
1690. However, Huygens used the idea of tautochronous motion in devising pendulum in 
1657 and gave a geometrical proof of this curve being a cycloid in 1673. 

Brachistochrones and tautochrones are found to be identical for only three types of con¬ 
servative force fields. One is the constant gravity field near the surface of the earth, the 
other two are the attractive and repulsive Hooke’s type of force field. For example, the 
attractive Hooke’s type of force field is found inside a homogeneous and spherically sym¬ 
metric gravitating body, whereas the repulsive type is the centrifugal force field in a plane 
perpendicular to the axis of rotation. In a constant force field the required curve is a cycloid, 
whereas it is a hypocycloid for an attractive type of Hooke’s force and an epicycloid for a 
repulsive force field, the magnitude of the force being proportional to the distance from a 
fixed point. 


7.2 BRACHISTOCHRONE FOR UNIFORM FORCE FIELD 


A particle of mass m is released from rest in a uniform field of force F = mg, g being a 
constant vector. The time taken for the particle to move from point 1 to point 2 (see Fig. 
7.1) is given by 


<12 


f 2 ds ^ 1 1 

h v Ji V 2 9 x 


dx 


where y' = dy/dx , ds = \Jdx 2 4- dy 2 and v = \ftgx, x being measured in the 
downward direction (that is along f) from point 1 and y in the horizontal direction to the 
right (see Fig. 7.1). The path is, by definition, a brachistochrone if the variation 


Sti2 


= 0 


(7.1) 


Copyrighted material 



224 Classical Mechanics 


or 



X 


Fig. 7.1 Possible variational changes in the travel time *12 between two 
fixed terminal points, marked by 1 and 2 


Now, from the Euler-Lagrange theorem we know that for any given function f(y,dy/dx,x ), 
the variation 



if and only if the function / satisfies the following differential equation 



Here /(y, y',x) replaces the Lagrangian L(q,q,t), x replaces the time t and y behaves as 
a coordinate and hence y' = dy/dx is a velocity component in the usual Euler- Lagrange 
equation of motion. 

In the present case, 

f{v,v\x) = 



Copyrighted 



Brachistochrones, etc. 225 


disregarding the constant factor i/Tg. We thus have 

ft - 0 and SL -■ 

0y ^ Vz(l + y' 2 ) 

Therefore the Euler-Lagrange condition reduces to 


V' 


L(K) =0 

dx\dy'J 


df 


dy‘ 


- = constant (independent of x) 


\/x(l + y' 2 ) \/2a 

where a is a constant. Therefore, 


Substituting 
we get 




• 2 p 

i = 2a sin - 
2 


y = a(0 - sin0) 

Therefore, a brachistochrone in a constant gravity field is given by the following para¬ 
metric equations: 

x = a(l — cos#), 

and 

y = a{6 - sin0) (7.2) 


The function (7.2) is displayed in Fig. 7.2. It is called a cycloid. A cycloid is a path 
traced by a point on the circumference of a disc rolling with a constant speed along a line. 
The cycloid given in Fig. 7.2 has the cusps pointing upwards and touching the y-axis at an 
interval of 2a. In this case the disc has to roll upside down on the y-axis which is, here, a 
horizontal line marked on a ceiling like plane. 


7.3 CYCLOID AS A TAUTOCHRONE 

We now show that a cycloidal slide track ( sliding without friction) in a constant gravity field 


Copyrighted ma 




226 Classical Mechanics 



X 


Fig. 7.2 A circle rolling, without slipping, on a straight line act as the 
generator of a cycloid 


corresponds to a tautochrone provided the special point is chosen so as to satisfy y' = 0, y 
being the measure of the ordinate. This is any one of the bottom most points of the cycloid, 
symmetrically situated between any two consecutive cusps of the cycloid. Let us now shift 
the origin of the previous cycloid (the one shown in Fig. 7.2) to the special point chosen 
above and define x as the horizontal axis and y as the vertically upward axis. The resulting 
cycloid (shown in Fig. 7.3) is described by the functions 


and 

where 6 is the amount of roll of 


x = a(0 + sin0) 

y = a(l - cos0) 
the generating disc of radius a. 


Let the coordinate of any point P on the track be described by the arclength s measured 
from the origin and the tangent at P make an angle (3 with the x axis. We have, for the 
differential arclength, 

ds = yjdx 2 + dy 2 — 2a cos | dd 


or 


= [ ds 

J u 


(7.4) 


Copyrighted material 




Brachistochrones, etc. 227 



Pig. 7.3 Change in the angle of slope of the cycloid ( 0 ) is related to the 
angle of rotation (0) of the generating circle, in the ratio 1 : 2 


Again, by definition of the slope of the track at the point P (see Fig. 7.3) 


tan/J = s 


tan- 


Therefore, we have 


implying that the tangent at any point on the track makes an angle 9/2, where 6 is the 
angle of rotation of the generating disc, due to which the reference point has moved from 
the origin O to the present location P. 

Now the equation of motion of any particle running down the track can be obtained from 
the knowledge of the acceleration of the particle at any arbitrary point of the track, say at 
P, which is given by 

i = - g sin 0 
. 9 

= ~ 9 Bin - 


or 


= _£i by Eq. (7.4) 

i+12 = o 

4a 


which represents a simple harmonic motion in s with a period 

T = 2 V? 


(7.5) 


Copyrighted material 



228 Classical Mechanics 


We know that the period of a simple harmonic motion is independent of its amplitude s. 
Hence the time taken by a particle to slide (from rest) from any point on the track to its 
bottommost point must be independent of the location of the starting point. This time is 
given by 



Hence any cycloidal track, oriented in the above mentioned manner, becomes a tau- 
tochrone for the constrained motion in a constant force field. 

This is obviously an important result. We know that a simple pendulum describes a 
circular arc rather than a cycloidal track. So its period of oscillation is amplitude-dependent, 
although the dependence is very weak for smaller amplitudes. Now if the bob of a simple 
pendulum could be made to follow a cycloidal track about its bottom most point 0 which 
must be situated at a point equidistant from the vertically oriented cusp as shown in 
Fig. 7.2, rather than a circular track, its motion will be perfectly tautoclironous (under 
the constant gravity field of the earth). The radius of curvature of the cycloid near the 
point O is 4a, hence one understands why the complete period of oscillation for the above 
tautoclironous motion has an equivalent suspension length of 4a, as obtained in Eq. (7.5). 

It was Galileo who while seated in a dinner party sometime in 1583, observed that the 
lamp hanging from the ceiling oscillates with a period independent of the amplitude of its 
oscillation, provided the amplitude was not too large. Later in 1657, Christiaan Huygens 
deduced from geometrical arguments that the required curve for a truly tautoclironous 
motion of a pendulum is a cycloid. But then the question was how to make a pendulum 
describe a cycloid rather than a circular arc? Fortunately, the trick lies in a definite property 
of the cycloids. It is known that the involute of a cycloid is another cycloid and we know 
that generally an involute is generated by the free end of a taut thread wrapped against 
its evolute curve (see problem number 1.39 and Appendix A2). So an idealised suspension 
thread of a simple pendulum can be wrapped and unwrapped against cheeks of appropriate 
shape. Actually the point of suspension is chosen to be any one cusp of an evolute cycloid 
with the cusp pointing upward. The other end of the thread that holds the heavy bob keeps 
the thread always straight and describes the involute of the curve defined by the cheek. In 
this case both are cycloids, and hence such a pendulum will execute perfectly tautoclironous 
oscillations at all amplitudes. 

Given a curve, it can be used as an evolute in order to generate its involute, which again 
can be used as an evolute in order to generate its own involute. This process leads to a 
family of curves called tesserals. Obviously, cycloids belong to a family of tesserals. 


7.4 BRACHISTOCHRONE FOR SPHERICALLY SYMMETRIC 
POTENTIAL FIELD F(r) 

Consider a region of space in which there acts a spherically symmetric potential F(r). We 
give here a formulation (solution) of the brachistochrone problem for a particle moving 
under such a general central force. 


Copyrighted material 



Brachistochrones, etc. 229 


The equation for the brachistochrone is generated by the variational equation 




dr 2 + r 2 d6 2 
2[E - V{r)) 




where r' = dr/dO, E = total energy per unit mass, V^r) being the potential. Having 
defined 


.1 n 

l§) = y E - V(r) 


f(r,r‘ 

the above variational equation would now hold only if it satisfies the Euler-Lagrange equa¬ 
tions in /, given by 

r " - Cr + m^v)) r ' ! 

where r" = <Pr/d6 2 and V' = dV/dr. 


V' 


2{E - K) 


(7.6) 


Equation (7.6) is a second order differential equation in 6 and not very easy to solve 
for the equation of the brachistochrone r = r(0). However, we know that the Euler- 
Lagrange equation must lead to an energy like integral of motion called the Jacobi integral, 
if f(r,r',0) is not an explicit function of 6 , which means that in the present case such an 
integral exists in the form of 

r'^4 — f — const. 
or 


This integral being a function of the first order differential coefficients readily corresponds 
to 


Kr* 


E - K(r) 


(7.7) 


where 


E - V(r a ) 
r 2 


= const. 


r„ being the distance from the centre of the potential to the nearest (in some cases, farthest) 
point on the trajectory at which point r' = dr/dO = 0. Being a first order differential 
equation, Eq. (7.7) is easy to solve for the brachistochrones in any given spherically sym¬ 
metric potential V(r), by the method of quadratures, for example. 


Copyrighted 



230 Clastical Mechanics 


7.5 BRACHISTOCHRONES AND TAUTOCHRONES INSIDE A GRAVITAT¬ 
ING HOMOGENEOUS SPHERE 


Let us now analyse the nature of brachistochrones and tautochrones for a special case of 
the central forces, namely for the one under the action of the gravitational potential inside 
a homogeneous sphere of radius R and mass M, the gravitational potential at any distance 
r < R from the centre of the sphere being given by 


V(r) = 



r < R 


If a particle starts from rest from any point on the surface of the sphere, its specific energy 
is E = V(R) = - GM/R. Therefore, at any point on the curve which lies totally inside 
the sphere, the energy condition is, 


V(r) 




Thus the equation for the brachistochrone inside a gravitating homogeneous sphere will 
have the fdjm given by Eq. (7.7), 


R 2 


2 KR> R 2 - r* 

GM ~ r 2 

and as before r„ refers to that point on the track for which r' = dr/d9 = 0. Now 


R 2 - r 2 T r ' 2 1 

- 72 — l 1 + ;y] = c °nst. (7.8) 

is the required equation for the brachistochrone, defining the track of the brachistochrone 
if the particle starts from rest on the surface of a homogeneous sphere of radius R. 


Integration of Eq. (7.8) gives the equation of the track as that of a hypocycloid which 
can be generated by the locus of a point fixed on a smaller circle of radius (R — r 0 )/2 , 
which is made to roll without slipping in the same plane but on the inside of a firmly fixed 
bigger circle Df radius R (see Fig. 7.4). If the ratio R/r a is irrational, the repeating arches 
of the hypocycloid never close on themselves. 


If the above procedure is taken to be the formal geometrical definition of a hypocycloid, 
we have R as the radius of the fixed circle, and define a = (R — r a )/2 to be the radius 
of the rolling circle, a parameter a = a/R < 1, and an angle 9 between the a> axis 
and the line joining the centres of the fixed and the rolling circle, so that the parametric 


Copyrighted 



Brachistochrones, etc. 231 



Fig. 7.4 A circle rolling on the inside of a bigger circle generates a hypocy- 
cloid 


equation of the hypocycloid is given by 

x(0) = jr|( 1 - a)cos0 + a cos j 

y{6) = fi|(l - a)siu0 - asin^ * — 


(7.9) 


Here ( x,y) are the Cartesian coordinates of the locus, measured from the centre of the 
fixed circle. Also note that in terms of the plane polar coordinates (r,0) the centre of the 
rolling circle is at r = R - a, and 0 = 6. The first terms in Eq. (7.9) correspond to the 
coordinates of the centre of the rolling circle, and the second terms are the coordinates of a 
fixed point on the rim of the rolling circle with respect to the centre of the rolling circle. By 
the time the centre of the rolling circle rotates by an angle 0 the rolling circle itself rotates 
in its own frame by an angle R0/a = 6a = <f> , say, but with respect to an outside inertial 
frame the amount of rotation is only <f> - 6, hence the odd argument in the second terms 
of (7.9). 

However we shall use here another form of the hypocycloid equation given in terms of the 
radial coordinate r (measured from the centre of the fixed circle to the point on the path) 
and the arclength s along the path (measured from the point of closest approach to the 
centre of the fixed circle): 

r2 = MTTT) + P - < 710 > 


The speed of the particle starting from rest on the boundary of the fixed circle is given 


Copyrighted material 




232 Classical Mechanics 


by 

Replacing r 2 
time one gets, 


r 2 ) (7.11) 

in Eq. (7.11) by Eq. (7.10) and differentiating Eq. (7.11) with respect to 


* = 


S + 


9» 


4a(l - a)R 


= 0 


(7.12) 


This again, being the equation of a simple harmonic motion, represents a taiitochronous 
motion executed on the above hypocycloid, where g is the value of the acceleration due to 
gravity on the surface of a nonrotating homogeneous sphere of radius R. 

Like the cycloids, hypocycloids also belong to a family of tesserals, that is, the curves of 
the same family serving as both evolutes and involutes. If someday our technology permits us 
to run a locomotive service along a hypocycloid tunnel joining any two points on the surface 
of the earth, it will then make us save the maximum possible time for shuttling between 
these two points at the minimum expense of fuel. When compared with straight line tunnels 
between any two points on the surface of the earth, the saving of time may sometimes be 
by a factor of 1.5 or more (see problem 7.4). It should be further noted that motion along 
a straight tunnel is also tautochronous, but such a track is not a brachistochrone. Hence 
all tautochrones are not brachistochrones, and all brachistochrones are tautochrones only 
under three special types of force fields mentioned in Section 7.1. 


7.0 TAUTOCHRONOUS MOTION IN A CENTRIFUGAL FORCE FIELD 
AND EPICYCLOIDS 

In this case also, the brachistochrones and tautochrones are identical and describe an epicy¬ 
cloid generated by the rolling of a circle of radius a along the outside of a fixed circle of 
radius R, that is concentric with the axis of rotation, both the circles lying in the same 
plane perpendicular to the axis of rotation (see Fig. 7.5). 

The equations are similar except for some changes in the +/- signs. For example, the 
parametric equation for an epicycloid would be 

')} 

jj (7.13) 

where R is the radius of the fixed circle on the outer periphery of which another circle of 
radius a = aR is made to roll without slipping. 0 is the angle of transportation of the 
centre of this rolling circle measured at the centre of the fixed circle with respect to the 
direction of the x-axis. It is obvious from Fig. 7.5 as well as from Eq. (7.13) that R is the 
radial distance of the cusps of the epicycloid and r„ = R(1 4- 2a) is the radial distance 
for the farthest point on the epicycloid measured from the centre of the fixed circle. / 

Now the centrifugal potential field generated by a rotating disc is given by V(’r) = 


x{0) = i?j(l -I- a)cos0 - a cos ^-— " -—6 
y(9) = #j(l + a) sin# - a sin 


Copyrighted 




Brachistochrones, etc. 233 



cloid 


-ju;, 2 ,r 2 , u)„ being the angular speed of rotation of the disc. A particle dropped gently on 
this rotating disc at r = R with no initial relative motion will trace an epicycloid on the 
surface of the disc, having cusps always at r = R, and the equation for the tautochronous 
motion, along the epicycloid will be given by 


4o(l 4- a) 


(7.14) 


where _ 

, _ VM - r»)(r» - W) 
R 

Epicycloids also belong to a family of tesserals. 


7.7 SUMMARY 

Brachistochrones and tautochrones are the curves of constraints for respectively ‘least’ and 
‘equal’ travel times for material particles moving under any given force field. They can 
be identical only for three types of force fields, namely cycloids for constant force fields, 
hypocycloids for attractive Hooke’s type of force fields, and epicycloids for centrifugal force 
fields in the equatorial plane. 


Copyrighted material 



234 Clatsical Mechanics 


PROBLEMS 

7.1 Two identical cycloids made of wood are fixed next to each other on the ceiling of a 
room, with all the cusps touching the ceiling. A simple pendulum with a flexible but 
inextensible cord is suspended from the wedge like meeting point of the two adjacent 
cusps. In order to make the pendulum swing with a period totally independent of the 
amplitude, what would be length of suspension l of the pendulum if the cycloids are 
generated from thtf rolling of a circle of radius R7 

7.2 Derive the equation for the hypocycloids and epicycloids as given by Eq. (7.9) and 
Eq. (7.12) from geometrical considerations, that is, just considering the rolling of a 
circle of radius a = aR on inside/outside periphery of a fixed circle of radium of R. 
Prove that the length s measured from any central point (equidistant from any pair 
of two consecutive cusps) satisfies an identical relation 


r„ being the distance of the central point from 0, the centre of the fixed circle, and r 
being the radial distance to any point on the hypocycloid/epicycloid measured from 
O. 

7.3 It is shown in the text that Eq. (7.9) for the hypocycloid corresponds to a tautochrone 
in an attractive Hooke’s type of external force field. Now show that they also satisfy 
the condition for brachistochrone, namely Eq. (7.8). 

7.4 Assume that the earth is spherical with radius R and that its interior is homoge¬ 
neous having a uniform density p. A deep underground hypocycloidal tunnel is made 
through the earth connecting two given places on the surface of the earth at an angular 
separation of 9 (with respect to the centre of the earth). Find the total length L, and 
the maximum depth H of the tunnel as a function of R and 9 only. Compare the 
time of transit of any freely moving bogie of a train along this hypocycloidal track to 
that along the straight tunnel connecting the same two points. 

7.5 If, instead of a particle sliding down without sliding friction, a thin disc of radius r 
rolls down without slipping under the action of the constant gravity field, show that 
the track should still be cycloidal for executing a tautochronous motion. 

7.6 How does the combined force field of the earth due to its own gravitation and axial 
rotation acting at any interior point of the earth affect the period of tautochronous 
motion, (in the equatorial plane) that satisfies the brachistochrone condition? If the 
earth be given sufficient angular speed of rotation, could the brachistochrones ever 
change from hypocycloids to epicycloids? 

7.7 A bead of mass m subject to the force F = cyj slides from x = - x ai y = y„ 
to a: = x 0 , y = y„ on a frictionless wire fixed in the x-y plane. The initial 
speed is v„ = y„y/c/m. Show that the trip time is minimised if the shape of the 


Copyrighted 



Brachistochrones, etc. 235 


wire is circular with radius y/z* + Vo- Determine the travel time and find a physical 
situation where the above can be realised. 

7.8 Solve the problem of brachistochrone for the surface constraint as a vertical cylinder 
between two points (p o , 0 o , 2 o ) and (p„,0,0), z a > 0, in the earth’s uniform field of 
gravitation. 


Copyrighted material 



8 

Canonical Transformations 


8.0 INTRODUCTION 

We now move to a new topic. This and the next chapter are supposed to complement each 
other. In most textbooks they are dealt with in a single chapter. So it would be advisable 
to read both the chapters first, before trying to solve the problems. 

We have already seen that the generalised coordinates and their corresponding generalised 
momenta could be physically or dimensionally anything except for their pairwise conjugate 
relationship, which maintains the dimension of the product of any pair of conjugate variables 
to that of action, or simply [ML 2 T -1 J. In this chapter, we shall in the name of so-called 
canonical transformations, transform more freely, not only the generalised coordinates and 
generalised momenta but also the value and the form of the Hamiltonian. The only require¬ 
ment we shall respectfully mete out is to retain the form of Hamilton’s canonical equations 
of motion, so that the new or transformed Hamiltonian also satisfies the 2 n equations of 
motion in the new coordinates and new momenta. The Hamiltonian is regarded merely as 
a mathematical function of a set of coordinate and momentum like parameters. 

In the process we can hit by chance or by some systematic procedure, a canonical trans¬ 
formation that can transform the old or normal looking Hamiltonian to any desired simple 
form of our choice, say proportional to the new momentum or the new coordinate. Natu¬ 
rally in such cases, solving the new Hamilton’s equations of motion becomes trivial. Then 
using the inverse transformations, these solutions can readily be transformed to represent 
the solutions in the old coordinates and old momenta. In this way we can spare ourselves 
from solving directly Hamilton’s equations of motion written in terms of the old coordinates 
and momenta. This is a kind of technical revolution for solving dynamical problems much 
like the invention of the Laplace transforms for solving complicated differential equations. 

Canonical transformations were introduced by Hamilton, but developed more fully by 
Jacobi at about the same time as Hamilton was doing them by himself. Their generality and 
far-reaching consequences were not immediately appreciated by the scientific community. 
It was only when group properties of transformations in general became well-known to the 
physicists that the importance of canonical transformations was fully realised. Nevertheless, 
through the use of phase space in statistical mechanics, the idea of energy (Hamiltonian) as a 
constant of motion played a key role in realising the microcanonical ensemble representations 
of the thermodynamical systems. We shall briefly discuss the nature of Hamiltonian flows 
towards the end of the chapter. 


Copyrighted material 



Canonical Transformations 237 


8.1 BACKGROUND AND DEFINITION 

For a holonomic bilateral dynamical system with n degrees of freedom, the choice of the n 
generalised coordinates is quite arbitrary. A given set of generalised coordinates need not 
reflect all the cyclic coordinates in the Lagrangian or the Hamiltonian expressed in terms 
of these generalised coordinates and generalised velocities or momenta. It is, therefore, 
very often needed to transform from one set of generalised coordinates, say ( 91 ,..., 9 n)> to 
another set of generalised coordinates, say (Q \,..., Q n ), connected by what is called a point 
transformation given by 

Qi = Qi( 9 i,..., 9 n) * = 1,• •. >n (8.1) 

The Jacobian of this transformation should not vanish in order that the inverse transforma¬ 
tions, 

9 . = t = 1 ,... ,n ( 8 . 2 ) 

also exist. 

Since the set (91 ,... ,-q n ) represents a definite point in the configuration space spanned by 
these n 9 -coordinates, it will lead to another definite point expressed by the transformations 
( 8 . 1 ) in the configuration space spanned by the n ^-coordinates. 

The Lagrangian L{q \,..., q n \ q \,..., q n ] t) can be expressed in terms of the new coordi¬ 
nates and coordinate velocities so that (see section 6 . 6 ) 

£(9, 9,0 = L(Q,Q,t) 

The new momenta are given by 

Pi = %r = i J i(9i,--- 5 9n;Pi,..-,Pn;<) 
dQi 

Note that the new momenta are defined in terms of the old coordinates, old momenta and 
time only. Thus the general form of the point transformation is 


Qi = Qi(qi 

,..., 9 „;t) and P, 

i = Pi(qu- 

••♦9n;Pl,.. 

• tPni t) 

(8.3) 

with the corresponding i 

9. = 9.(Qi»- 

inverse transformations 

..,Q n \t) and p,- 

= Pi(Qu •• 

• > Qni Pi 1 • • 

,Pn,t) 

(8.4) 


We must also require that Eqs (8.1) and (8.2) leave the form of Lagrange’s equations of 
motion invariant. It can easily be shown that Eqs (8.1) and (8.2) satisfy this requirement. 

Point transformations (8.1) and (8.2) are defined over the configuration space. Hamilton’s 
equations of motion, however, describe the evolution of the state of a system in its phase 
space. Thus, whenever Hamilton’s equations of motion are used to describe the system’s 
dynamics, the transformations (8.3) and (8.4) must be replaced by a suitable phase space 
transformation, namely 

Qi = Q.(9i>--->9n;pi>**-»p»;t) p i = / , i(fi,---.f«;pii---»PniO (8.5) 

II we now want to have a physically meaningful phase space transformation, it would be 


Copyrighted 



238 Classical Mechanics 


quite logical to demand that the transformations (8.5) should leave the form of Hamilton’s 
canonical equations of motion invariant. Equivalently, we may require the invariance of 
Hamilton’s principle under the transformations (8.5), that is, 


0 = 



PiQi - H(q,p,t) 


dt 



PiQi - K(Q,P,t) 


dt 


( 8 . 6 ) 


where K(Q,P,t) is the transformed Hamiltonian. The condition given by Eq. (8.6) leads 
immediately to Hamilton’s canonical equations of motion 


Pi 


dK 

dQi 


and Qi 


dK 

dPi 


(8.7) 


which must be satisfied by the new coordinates and momenta (Q,-,P,). 

The phase space transformations (8.5) that preserve the forms of the canonical equations 
of motion, (that is, of the Hamiltonian equations of motion) are called Canonical Trans¬ 
formations (CT). Some textbooks however, prefer to define CTs through the invariance of 
elementary Poisson Brackets. The two definitions are equivalent. 

We can also think of transformations 


9i = fi(Qi,....Qntt) and pi = Pi 


such that only coordinates are transformed in the phase space without touching the sector 
for momenta. Such transformations are by nature point transformations, and are called 
extended point transformations. 


8.2 GENERATING FUNCTIONS 

The essential requirement for the canonical transformation is that every natural path in 
the phase space spanned by (p, q) should, when mapped onto the transformed phase s- 
pace spanned by (P,Q), again be a natural path satisfying the corresponding Hamilton’s 
equations of motion. 

We know that Hamilton’s equations of motion result from Hamilton’s principle which 
requires that both the terminal coordinates and the terminal times be kept fixed and the path 
at all other points be arbitrarily varied from the natural one. Hence we can always add to 
the integrand of the transformed quantities in Eq. (8.6), namely PiQi — K(P , Q,t), a total 
time derivative of any function whose variations vanish at the limits of the integration. Since 
the variation of i, qi ,..., q ny and hence of t, Qi,..., Q n have to vanish at the end points, for 
the validity of Hamilton’s principle, such a function must be the time derivative of a suitable 
function that depends on t,q u ... ,q n ,Q u ... ,Q n , say Fi{qu... i q n \Qi i ...,Q n ]t), which 
automatically satisfies 



Copyrighted 



Canonical Transformations 239 


Thus we can generalise the requirement given by Eq. (8.6) on transformations (8.5) as 

0 = 6 /‘[E 

Jt. 

= if [s.P.Oi - K(P,Q,t) + ^F,(,,<?,«)] <U 
where the two integrands are in general related by 


A[E iPM - 

= EiP'Qi - K(Ql . Qn\Pl .P„|i) + jflM't) (8.8) 

Here A is a constant scale factor, called the valence of CT. When A ^ 1, the transforma¬ 
tions (8.8) are called the extended canonical transformations. Normally A is set to unity 
and the transformation (8.8) is called univalent or ordinary canonical or simply canonical 
transformation. Henceforth we shall use A = 1 for which the conditional requirement (8.8) 
becomes 


PiQi - H(q,p,t) =P^i - K(Q,P,t) + 


dFi(q,Q,t) 

dt 


- - K(Q ' p ’ l) + ? W + ^‘ + W?' 


(8.9) 


Henceforth Einstein’s summation convention is implied for all the repeated indices used in 
any term. Rewriting Eq. (8.9) in differential form 


Pidqi - H(q,p,t)dt = P t dQ, - K(Q,P,t)dt + ^dt + ^-dq, + ^-dQ, 
and treating q, Q, t as independent variables, we now get 



Pi = - (|^) for * = 1 >-> n (8.io) 

K(Q,P,t) = 


The function Fy(q,Q,t) is called the generating function of the canonical transformation 
because it specifies the required equations of the transformation, namely the ones given by 
Eq. (8.5). 

Therefore, for univalent CTs, the physical dimension of the transformed Hamiltonian 
remains unchanged. 

Furthermore, if Fy is not an explicit function of time, that is, Fy = Fy (q, Q), the value of 
K , the transformed Hamiltonian (also jocularly called ‘Hamiltonian’) is the same as that of 
the Hamiltonian H in the original phase space, for every point of the phase space spanned 


Copyrighted material 



240 Classical Mechanics 


by ( Q,P ). But in general, the value of the Hamiltonian changes under any time dependent 
canonical transformations. 

8.2.1 How to Obtain the Required CT When the Generating Function is Given 

When the generating function is given one can uniquely construct the transformations (8.5) 
and the transformed Hamiltonian in the following four steps: 

(a) Construct the n equations of the old momenta using the first of the set of relations 
(8.10), that is, obtain 

Pi(i.Q.t) = (|£) i = 1.- (8-11) 

From these relations by algebraic manipulation try to solve for Q,’s, so that Qi s are now 
expressed as Qi = Qi(q,p,t). This is possible because the transformations (8.5) are 
supposed to be invertible, at least locally. So we obtain the new coordinates as the functions 
of the old coordinates, old momenta and time, that is, half of the required equations for the 
CT are constructed. 

(b) Take the second group of n independent equations of the new momenta in Eq. 
(8.10), given by 

= = P.W,Q,t) i = 1 .» 

and substitute for all Qi = Qi(q,p, t) that are obtained from the step (a) above in each of 
these Pi equations in order to obtain Pi = Pi(q,p,t). So we get all new momenta as the 
functions of the old coordinates, old momenta and time. 

(c) Find the inverse transformations p, = Pi(Q,PJ) and q, = qi(Q,P,t). 

(d) Now find K(Q,P,t) from P(p, q,t ) + dF\/dt by substituting all p’s and q ’s as 
functions of Q, P, t only, in both these terms. 

An example: Consider the har moni c oscillator problem with H = p 2 /2m + kq 2 /2 and 
a generating function F\ = 1/2 y/km q 2 cot Q. The above procedure as in part (a) yields 

Q = tan ~ l (\/krn q/p) 

part (b) gives 

P = \/m/4k ( kq 2 + p 2 /m) 
part (c) prescribes the inverse transformations, 

p = {kmY^y/2P cos Q and q = (&m) -lj ' 4 \/2P sinQ 
and finally part (d) gives the transformed Hamiltonian 
K — y/kfmP 

One may note that the original Hamiltonian which was quadratic in both q and p, has 


Copyrighted 



Canonical Transformations 241 


been transformed to a form which is linear in P with no dependence on Q. With such a 
simple form of the Hamiltonian, it is very easy to solve the equations of motion in P, Q 
and then, if necessary, one can substitute back for P and Q and get the solutions for q and 
p, which would be the same as those, obtained by solving the original Hamilton’s equations 
in q and p directly. 

8.2.2 How to Obtain the Generating Function Fi when the CT is Given 

In this case Fi(q,Q, t) is not given but the transformation Eqs (8.5) are given. The following 
procedure may be adopted in order to find Fi(g,Q,<). 

(a) Invert Eqs (8.5) in order to obtain p< = Pi(Q,P,i) and qi = qi(Q,P,t)- 

(b) Again take Eqs (8.5) and eliminate all p ?s so that after rearranging one obtains 
Pi = Pi(q,Q,i)- Similarly again take Eqs (8.5) and eliminate all Pi’s so that this time on 
rearrangement, one obtains pi = p,(q, Q , t). 

(c) Write these newly obtained expressions for pi = Pi{q,Q,t) and Pi = Pi(q,Q,t) in 
the partial differential equations 

P>M,t) = g and Pi (,,Q,t) = - ** 

respectively, and integrate both these partial differential equations separately. The first one 
will result in a solution to Pi {q,Q,t) except for the constant of integration, which would be 
a function of Q's and t only. Similarly, the second one will also result in another solution 
to Fi(q,Q,t) except for its constant of integration which will now be a function of and 
t only. These two expressions for F\ can be suitably combined to find the correct solution 
for Fi(q,<?,*)• 

(d) Once Pi(g, Q,t) is obtained, find K from H(p,q,t) + dFi/dt by substituting for 
p’s and q's from the inverse transformations to Eqs (8.5). 

An example: 

Given Q = log[(l/g) sinp], P = q cotp and H = p 2 /2m + kq 2 /2, find Fi(q,Q,t) 
and K(Q,P,i). 

Following the above procedure one first obtains the inverse transformations as p = 
cos -, (PexpQ); q = (exp(-2<?) - P 2 ] 1 /* which can be used to finally obtain Fi(q,<?) = 
q cos~ 1 [(1 - q 2 exp 2<?) 1/2 ] + (exp(-2Q) - q 2 ] 1/2 

8.2.3 Other Principal Forms of the Generating Function 

The generating function need not always be an explicit function of q , Q , t only. In fact 
one can apply Legendre’s transformations to Fi(g,Q, t) and obtain generating functions 
as explicit functions of other variables. Apart from the passive coordinate t , the other two 
independent coordinates in a generating function can be any two (one old and the other 
new) out of q,Q,p and P, but never more than two because the third one can always be 
expressed in terms of the first two. 


Copyrighted material 



242 Classical Mechanics 


Sometimes a generating function of the form F 2 = F 2 (q,P,t) is more useful than 

F\. In order to go from F\ to F 2 using Legendre’s transformation, we must note that 
the new variables P { = — dF x /dQi are as independent as Qi s, hence Legendre’s dual 
transformation of Fi will be 


F 2 (q,P,t) = F^Q.t) + PiQi 

Therefore, 

Pidqi - Hdt = PidQi - Kdt + dF x 

= PidQi - Kdt + dF 2 - d{PiQi) 

= ~ Kil + w dt + S* + W dp ' ~ Q - ip ' 

Since in this case q , P and t are all regarded as independent, one obtains, 

-( 8 ). 


■ (SL 


i = l,...,n, 


( 8 . 12 ) 


K = H + 


0F 2 


Similarly one can form a third generating function F 3 (Q,p,<) from F x by replacing qS s 
by P«’ s given by pi = dF\/dq <, in wliich case, Legendre’s dual transformation becomes 

* 3 (Q,P,t) = Pi(q,Q,t) - Piqi 

Following the same procedure as in the case of F 2 , we obtain 



K = H + 


dF 3 

dt 


i = 


(8.13) 


Similarly, a fourth generating function F A (p, P,t) can be constructed either from F 2 or 
F 3 or even from F x where 

F 4 (p,P,t) = F!( 9 ,Q,<) - q iPi + PiQi 
= F 2 (Q,p,t) + PiQi 
= F*(q,P,t) - p^i 


Copyrighted 



Canonical Transformations 243 


giving finally, 



(8.14) 


These four generating functions have striking similarities with the four thermodynamic 
potentials U' } H', F' and G' described earlier in connection with Legendre’s transfor¬ 
mations in section 5.1. These four generating functions also satisfy Maxwell-like relations 
which can be obtained by differentiating once more the first two relations of each of the sets 
of Eqs (8.10), (8.12), (8.13) and (8.14). We quote the final results 



each one obtained in sequel from the four distinct forms of the generating functions, namely 
F \(?,<?,<)> F 7 (q,P,t), F 3 (p,Q,t ) and F 4 (p,P,<). 

Equations (8.15) are to be satisfied by all univalent canonical transformations. The 
beauty of these equations lies in their symmetrical forms, which are totally independent 
of the Hamiltonian or the generating functions. Given a set of canonical transformations 
either in the form of Eqs (8.1) or in the form of Eqs (8.5), they must satisfy all the four sets 
of equations given in Eqs (8.15). 

An interesting point to note is that in all four cases the value of the transformed Hamil¬ 
tonian K is related to the value of the old Hamiltonian by a similar relation, the value 
differs by the partial time derivative of any one of the generating functions. If there is an 
explicit time dependence in the transformations themselves the generating functions must 
also have explicit time dependence because Q = Q(t) and P = P(t) imply moving 
frames of reference and hence energy or Hamiltonian cannot be the same in the two frames, 
even though with respect to either frame they may be a constant of motion. 


8.2.4 Conditions for Canonicality 

The property of the canonical transformations, namely the occurrence of the same time 
parameter t in both the phase spaces spanned by ( q,p ) and (Q,P), should in principle 
allow us to test the condition for canonical transformation in terras of q, p , Q, P and their 
differentials only. Time can be treated as an independent parameter. In fact this is quite 


Copyrighted materii 



244 Classical Mechanics 


obvious in all the four sets of Eqs (8.15) that can be used for testing the canonicality of 
a given phase space transformation. All these conditions are also seen to be evaluated at 
constant t, which signifies the purely geometrical nature of the canonical transformations 
that take place in terms of the phase space coordinates alone. Although the conditions for 
canonicality by definition must be such as to preserve the form of Hamilton’s equations of 
motion, the conditions themselves in the form of Eqs (8.15) are intrinsically independent of 
the Hamiltonian. In order to appreciate this point further, let us recall the condition for 
canonicality in terms of the generating function Fi(q,Q,t) from Eq. (8.9) given by 

Pidqi - H(p,q,t)dt = PidQi - K{P,Q,t)dt + dF 1 (q,Q i t) (8.16) 

Here dqi and dQi are the differential (perfect) changes in the coordinates in real time dt. 
Since the transformations in the phase space between two sets of coordinates qf s and Qi s 
are essentially geometrical (that is, without involving t —* <'), we can look upon dqi and 
dQi as the elements of static geometrical curves rather than evolving paths, so far as the 
testing of the canonical condition is concerned. This leads to setting dt = 0 in Eq. (8.16) 
which gives 

Pi 6qi - Pi 6Qi = an exact differential (8.17) 

where < in Pi(q,p,<),Qj(9>P»0 and Fi(q,Q,t) is held constant while finding the differentials 
Sqi and 6Qi. This is analogous to the testing of a geometrical condition on rheonomic 
constraints which can proceed with the contemporaneous variations of the coordinates. 
Here in the above condition Pi = Pi(q,p,t) and Qi = Qi{q,p,t) are already used to 
evaluate PidQi in terms of (p,q,i) only, so that the exactness can be tested in either the 
(q,p) space or the ( Q,P ) space but not in any mixed space. Also note that the process 
of setting dt = 0 automatically removes all the Hamiltonian dependent terms from Eq. 
(8.16). 

Now such a set of equations in terms of differentials is said to have the property of contact 
transformations, which are defined as follows. 

If the equations connecting two sets of variables (qi,..-,q n , Pi,-- . ,p„) and {Qi,...,Q n , 
Pi,..., Pn) are such that the differential form 

P\dQ\ + ... + P n dQ n - {pidqi + ... 4- p n dq n } 

is, when expressed in terms of (qi,...,q n , Pi,---,Pn) and their differentials, the perfect 
differential of a function of (<7i,... ,q n , Pi,--- ,p n ), then the change from the set of variables 
from (qi ,..., q n . Pi,..., p„) to (Qi,..., Q n , Pi,..., P n ) is called a contact transformation. 
Its essential geometrical property is that if one draws two curves which meet tangentially 
at some point then in the transformed phase space the corresponding curves are also bound 
to meet tangentially at the corresponding phase space point, thus emphasizing the meaning 
of the adjective ‘contact’. 

Now the condition (8.17) does satisfy this property, and hence all canonical transforma¬ 
tions are nothing but contact transformations , so long as the time parameter itself is not 
transformed. The condition (8.17) can also be derived from the first of Maxwell’s relations, 


Copyrighted material 



Canonical Transformations 245 


that is, 


because we know that if 


(*SL) 

(6M\ _ (dN_\ 

\dy) t \dx) v 

then Mdx + Ndy is a perfect differential, which gives the condition (8.17). 

Similar conditions can be derived from the other three equations of Maxwell, and they 
are 

p%Sqi + QiSPi = a perfect differential 
qiSpi + PiSQi = a perfect differential 
and 

qiSpi - QiSPi = a perfect differential 

Some more tests of canonicality will be given in the next chapter and in problem 8.6. 


8.3 PROPERTIES OF CANONICAL TRANSFORMATIONS 


1. The Jacobian determinant of the univalent canonical transformations is unity, with 
the result that any finite volume of the phase space before and after the transformation 
remains the same. (Canonical transformations preserve volume in the phase space.) 

Let 

Q { = Qi(q,p y t) and P y = Pi{q,p y t) i = l,...,n 

be a univalent canonical transformation. The Jacobian of the transformation is defined to 
be 


(8.18) 


Introducing an intermediate pair of sets of independent coordinates (q, Q ) we can write 

J = 9{Qu-- ,Qn]Pu---,Pn) d{q u ...,q ni Q l ,...,Qr,) 

0(<7i,--.,?n;Qi,...,<?»)' 0(?i,...,?«;Pi,...,Pn) 

We now interchange the left and the right halves of columns in the first factor and write the 
second factor in terms of the corresponding inverse transformations. We get 


<KQ 

!*•••» Qn\P\y 

..,Pn) 

^(?1 > • • •»0ni Pi > • • 

•,Pn) 


r oq i 

091 

OQl i 
Opn 

det 

.T& . 



fWi... 

.,Qn\P\ t - 

;Pn) 

\d(q u . 


•iPn) ' 

LWi,. 


•»9«) . 


• i9ni Qlt • 

..,<?n)J 


Copyrighted material 



246 Classical Mechanics 


These two determinants have the following general forms respectively: 



From these general forms, it is easy to deduce that 


J = 


0(P„.. 

-,p«)l 

[0(P1,- 

0(fc,.. 

• >9n) Qi a 

[fKQu. 



WdFx/dQu. 

.JFt/OQJ] 

\didF t /dq lt . 

■ t dF l /dq n )] 


• •»9n) 

Q..I Wl... 

• • iQn) J, 


It is now obvious that these two determinants are identical, and therefore, J = 1 . Hence, 


dqi... dq n dpi ... dp n = J dQ \... dQ n dP\... dP n 
= dQi...dQ n dPi...dP n 

In other words, the volume element of the phase space remains unchanged for any univalent 
canonical transformation. This is generally valid when the phase space is constructed in a 
Cartesian fashion. Note that a change from the Cartesian to the spherical polar coordinates, 
has the Jacobian 

J{z,y,x)—(r,9,4>) = r 2 sin 8 ^ 1 (8.19) 

So this extended point transformation (x,y,z,p r ,p v ,p,) —»( r,9,<f>,p x ,p v ,p ,) is not canoni¬ 
cal. Try to solve problem number 8.5 for further clarification. 


2. All univalent canonical transformations form a group. For this we need to show that 

(a) The identity transformation (q,p) —► ( Q,P ) where q = Q and p = P is a 
canonical transformation. 

(b) Two canonical transformations performed in sequence correspond to a single canonical 
transformation (closure condition). 

(c) The inverse of a given canonical transformation exists and is itself a canonical trans¬ 
formation. 

(d) The CTs performed in order (CiC 2 )C 3 and C\{C 2 C 2 ) are identical, that is, the 
composition of CTs is associative. 


To show (a) we choose a generating function F 2 (q,P) = qiP{. Therefore, 

and 


dF 2 _ 

Pi = ^7 = P * 


n _ dF * _ 

Qx dPi q ' 


which demonstrates the existence of an identity transformation which is canonical. 
To show (b), let 


and 


Q = Q(Q,P,t) P = HQfPft) 
Q = QiitP,*) p = P(q,p,t) 


Copyrighted material 



Canonical Transformations 247 


be the canonical transformations. Therefore at any fixed time we can write, using Eq. 
(8.17) 

dF(Q,Q,t 0 ) = PiSQi - PiSQi and dF{q,Q,t 0 ) = p { Sq> - PiSQi 
Adding these equations we get 

d(F + F) = piSqi - PiSQi (8.20) 

which proves the assertion. 

To prove (c) let 

Q = Q(q,P,t) and P = P(q,p,t) (8.21) 

be a canonical transformation. We want to prove that the inverse transformation 

q = q(Q,P,t) and p = p(Q,P,t) (8.22) 

exists and is canonical. 

For this we assume that the derivatives of Q and P are continuous. Then a well-known 
theorem in calculus, the inverse mapping theorem, states that Eq. (8.22) exists at every 
point in the range of Eq. (8.21) provided the Jacobian determinant of Eq. (8.21) (Eq. 8.18) 
is nonzero at every point in the domain of Eq. (8.21). We have already proved that the 

Jacobian determinant of Eq. (8.21) is unity over the domain of Eq. (8.21). This proves the 

existence of Eq. (8.22). 

Since the transformation (8.21) is canonical, by Eqs (8.16) and (8.17) we get, for every t„ 

dF(q,p,t 0 ) = pMi ~ P^Qi (8.23) 

If we substitute the inverse transformations (8.22) in Eq. (8.23) we find that the transformed 
function - F(Q,P,t„) satisfies the relation 

d\-F(Q,P,t 0 )) = PiSQi - piSqi 
which shows that the transformation (8.22) is canonical. 

To prove (d) consider the following canonical transformations 
C: (,<'>, p O) 

and 

A : (,<*>,?<’>)'’"' (,<»,„<») 

Using the procedure leading to Eq. (8.20) we see that the generating function for both the 
compositions, A(BC) and ( AB)C , are + F l) + i* 2) showing that (AB)C = A(BC). 

Therefore, all univalent CTs form a group , or in other words, the algebra of the group 
are uniformly applicable to all the possible examples of CTs. 


Copyrighted material 



248 Classical Mechanics 


8.4 SOME EXAMPLES OF CANONICAL TRANSFORMATIONS 


The examples given below are picked in order to demonstrate how versatile the CTs are and 
how the hidden symmetries become quite transparent in many cases. 

1. Let us study the canonical transformation generated by Fi(q,Q,t) = q,Q>- 
It immediately gives 

<9Fi 

Pi = -^=Q. and (8.24) 

Equations (8.24) show that momenta and coordinates are canonically equivalent. They can 
be exchanged except for a sign. This fact can be appreciated by noting that Hamilton’s 
equations of motion are symmetric in pi's and q t 's except for a change of sign. Thus gen¬ 
eralised momenta and coordinates are completely equivalent in the description of the phase 
trajectories of a system obeying Hamilton’s equations of motion. The coordinate sector of 
the phase space can interchangeably be read as the momentum sector except for the flip of 
sign to be introduced. 


2. The extended point transformations are in general not canonical transformations. It 
should be cautioned that most textbooks claim the opposite. 

Usually the generating function is chosen to be 

*2 = ^ Pi 


giving 


Qi 


dF 2 

dPi 


/«(?! 




This justifies only a partial requirement of a point transformation. Now the above form of 
F 2 also gives 

P d A 

P ‘ dqt ~ 'dq, 


which means that p, / P,. Hence the above transformation is in general not an extended 
point transformation. 


3. The gauge transformations for the electromagnetic potentials are merely canonical 
transformations in the phase space. 

It is well known that Maxwell’s equations of electromagnetism do not specify the electro¬ 
magnetic potentials A(r,<) and <f>(r,t) uniquely. If the potentials are changed as 

A\r,t) = A(r,f) + V/(r,i) and *'(r,() = «r,i) - (8.25) 

where /(r,t) is any arbitrarily chosen scalar point function; both the sets (A',^') and 
(A, <f >) satisfy the same Maxwell’s equations. The transformations (8.25) are called electro¬ 
magnetic gauge transformations , under which Maxwell’s equations remain invariant. 

We now show that the gauge transformations (8.25) can be effected through a canonical 


Copyrighted material 



Canonical Transformations 249 


transformation of coordinates r and momenta p to (Q, P) as defined below. 

Choose a generating function 

F 7 (r,P) = r P - c f(r,t) 

This gives 

P =^=P-eV/ and = r (8.26) 

which means that Q = r and Q = r. The new Hamiltonian is 

K(P,Q) = B + ^ = H-e^ (8.27) 

Now we know that the canonical momentum is 

p = mv + eA 

and if the gauge transformations (8.25) correspond to the canonical transformations (8.26) 
then the canonically new momentum P which is equal to p + eV/ must correspond to 
mQ + cA', that is its gauge equivalent. This is shown as follows: 

P = p + eVf = mr + eA + eVf 
= mQ + e(A + V/) = mQ + cA' 
giving the first of the Eqs (8.25). 


Similarly from Eq. (8.27) we get 

K - H .°f , 6 f 

K ~ H ~ 'm = 2 mr +e *- e Tt 

= + e - f] = + e 

giving the second of Eqs (8.25). Thus the new canonical momentum and energy are the 
gauge transforms of the old canonical momentum and energy respectively, showing that the 
electromagnetic gauge transformations can be viewed as a univalent canonical transforma¬ 
tion (8.26). 


4. The Lagrangian gauge transformation given by 
L\q,q t t) = L{q,q } t) + 

can also be viewed as a canonical transformation effected by the generating function, 

F*(q,P) = qiPi ~ /(?,<) 

giving 

Hb\ Hf 

and Qi = = qi (8.28) 


„ _ m _ p _ 

Pl d qi Pi d qi 


Qi = d J± 

V * dPi 


Copyrighted material] 



250 Classical Mechanics 


and 


K 


H + 


dF 2 




(8.29) 


Note that qi = Qi and hence = <?;. 

Now one has to see whether the new momenta Pi and Hamiltonian K obtained via 
canonical transformation are the same as those expected from the new Lagrangian L\ that 
is, whether 

Pi = and K = ^ Qi - L' 

In order to prove this, consider 

dV OV dL d_ / 4f\ 

dqi ~ din + din \dt) 


dQi 


Further, 


d r df df .1 df 

= Pi + Wi + H H = Pi + Wi = ‘ 

A ev . / df\ df 9f 
Qi Wi " = ^) - L ~ « ' % 


K 


lAn r\ 9f_ df 

= (QiPi - L ) - Yt - H ~ Tt 

which proves the assertion. 

5. Any infinitesimal coordinate transformation is a canonical transformation. 

Choose the generating function to be 

F 2 {t,P) = r P + Ao P 

where Aa is a constant vector representing an infinitesimal coordinate transformation, 

Q ->r + Ao (8.30) 


or equivalently, a shift in the origin of the coordinate system by - Aa. The above generating 
function gives 

<?= ^ =, + A< * r = IF = p and K = H (831) 

leaving momentum and Hamiltonian unchanged as expected. Equations (8.31) obviously 
correspond to the infinitesimal coordinate transformation (8.30), which is now proved to be 
aCT. 


6. Any infinitesimal rotation by A<J> of the coordinate frame about an axis passing 
through the origin and pointing in the direction of the unit vector n corresponds to a 
canonical transformation. 

Take as the generating function, 

F 2 (r, P) = r P + A0 n • (r x P) 


Copyrighted 



Canonical Tranaformatiom 251 


Hence 


Q = -Qp = r + A^(nxr) 

V = ^7 = P - A*(»xP) 


(8.32) 


Equations (8.32) correspond to an infinitesimal rotation A <f> of the system around any 
arbitrary direction (A) or equivalently to the rotation of the coordinate frame by - n A 
around the same direction. 


7. Any infinitesimal transformation involving a shift only in time by A t is also a canonical 
transformation that connects the evolution of the coordinates and momenta in time. 

Since the time evolution of a conservative system over any interval of time is governed by 
the Hamiltonian, we choose as the generating function 

F 2 (q,P) = qP + At H(q,P) (8.33) 

This gives 

QW = = i + A(|| = « + a tQ 

~ g(t) + A tq ~ q(t + A t) 

correct to the first order of smallness of At. The above approximations are possible because 
of the smallness of At. Again, 

p( ( ) = ^ = i’ + A i ^ = P ( ( ) -pA< 

Thus we get 

Q(t) = g(t + At) and P(t) = p(t) + p At = p(t + At) (8.34) 

and also 

K = H(q{t + At),p(t + A(t)) (8.35) 

Equations (8.34) and (8.35) show that an infinitesimal change in time by At leads to 
the evolution of the conservative system from q(t) to Q(t) = q(i + At), p{t) to 
P{t) = p (< + At) and H(q(t),p(t)) to K(Q,P) = H(q(t + A<),p(f + At)). This 
transformation can be viewed as an infinitesimal canonical transformation generated by the 
generating function (8.33) which involves the Hamiltonian in the infinitesimal term. This is 
the reason why the Hamiltonian is called the generator of evolution. 

Since canonical transformations are associative, a large number of infinitesimal trans¬ 
formations of the above kind can be applied in succession. Thus, any general solution 
q = q(q<>,Po,t) andp = p(qo,Po,i) from the initial values (q a ,Po) can be regarded as a 
canonical transformation between the initial and final values, that is, directly from (q 0 ,p 0 ) 
to (q, p). 

8. The canonical transformation generated by 

Fj(r,jP) = r-P + (V-P)t - m(r-V) 


Copyrighted material 



252 Classical Mechanics 


also represents a physical situation. 

In this case, we have 

p = = P - mV or P = p + mV 

or 

Q = = r + Vt (8.36) 

and 

K = H + ^ = H + P V (8.37) 

at 

We know that Eqs (8.36) and (8.37) represent a Galilean transformation from (r,p) to 
(Q, P), where the origin of the (r, p) system is moving in read space with a constant velocity 
V with respect to the origin of the ( Q , P) system. 

9. A changeover to a uniformly rotating frame of reference from a stationary frame can 
also be shown to be canonical using the following generating function. 


F 2 (r,P) = t P - toj (r x P) 
because this generating function gives 

dF 2 
dr 
dF 2 
dP 


P ■■ 

Q ■ 


P + (u> X P)t 
r + (r x w) t 


(8.38) 


and 


K — H + 


dF 2 


= H - u> • (r x P) 


(8.39) 


= H — u • (Q x P) = H — u - L 

where L is the angular momentum measured in the rotating frame. These results can be 
verified from the ones given in chapter 3, namely Eq. (3.8). 


8.5 CANONICAL TRANSFORMATION TO THE FREE PARTICLE HAMIL¬ 
TONIAN 

The principal utility of the canonical transformation is to transform the physical Hamilto¬ 
nian H(q,p,t) to a new Hamiltonian K(Q,P,t) such that the corresponding equations of 
motion become easier to solve and also display all possible cyclic coordinates and conserved 
quantities. As an illustration we shall present here examples in which the transformed 
Hamiltonian of a non-free system may look like that of a free particle. 

Let us consider the motion of an electrically charged particle having charge e and mass 
m in a uniform electric field E with E pointing along the one-dimensional positive q-axis. 
The Hamiltonian is 

« («,P) = - eE? 


Copyrighted 



Canonical Transformations 253 


giving the complete solutions of Hamilton’s equations of motion as 
p = eEt + p 0 

" = (I!)'’ + (m)‘ + 

We want to canonically transform (q,p) to (Q, P) such that the transformed Hamiltonian 
K(Q,P) assumes the form of a free particle Hamiltonian for mass m, that is, 




with a complete solution, 

Q 


2m 


-(£) 

een thei 

m’ 


t + Q 0 


Now eliminating t between these two sets of solutions and setting q a = Q„ = 0 and 
Po = Po = P we get, 


if) 


+ P 


+ Q and p = eE 

This is the required canonical transformation. 

Similarly, the Hamiltonian for; a simple 1-D harmonic oscillator in (q,p) is given by 

, , 2 „ 2 


(8.40) 


B{q,p) 


p 2 1 

2^ + 2^ 9 


with a complete solution 


p = p 0 cos(wt) - muq 0 sin(u;<) 

and 

q — -^-sin(u><) + q 0 cos(u;<) 
mu 

On the elimination of t between these solutions and the required free particle solution in 
(Q, P) with Q 0 = q 0 = 0 and p a = P a = P , one obtains 

V=p cos(^) and «=(£;) «„ (^) (»•«) 

In some extreme cases it is even possible to make the transformed Hamiltonian totally 
vanish, that is, K(Q,P,t) = 0, irrespective of the given form of H(q,p,t). When we use 
Hamilton’s principal function W(qi, ..., q n \i) (see Eq. 6.38b) as a generating function, the 
transformed Hamiltonian vanishes automatically because of the identities 

Such a class of solutions and the corresponding CTs will be of utmost importance, and in 
fact forms the basis of chapter 10. 


Copyrighted 



254 Classical Mechanics 


It may not be out of place to make comment on Eq. (6.38a). It is obvious that the 
normal to the equi-action surface for W in the n-dimensional configuration space is defining 
the canonical momentum as a vector in the same configuration space. The Hamiltonian 
being a surface in the phase space finds a place also in the configuration space through the 
function W , through the relation Eq. (6.38b). The partial time derivative of W at a given 
point in the configuration space defines the form of the Hamiltonian. 


8.6 LIOUVILLE’S THEOREM 

We know that every possible state of a system is represented by a unique point in its 
phase space. Let us call this point in the phase space to be the image point of the system. 
Since the state of the system at any given time (through Hamilton’s equations of motion) 
determines uniquely its state at any other time, the motion of the image point in the phase 
space (determining the changes of state the system with time) is uniquely determined by 
its initial position. The path traced by the image point in the phase space as it moves in 
accordance with the time evolution of the system is called a trajectory. Obviously, one and 
only one trajectory passes through each point of the phase space as Hamilton’s equations 
of motion fix the local gradients uniquely. 

Suppose at some time t 0 , the image point of the system in its phase space is at M 0 and 
at some other time t (succeeding or preceding t a ) at some other point M. We know that 
the points M 0 and M determine each other uniquely. So we can say that, during the time 
interval t 0 — t the point M 0 of the phase space goes over to M. During the same interval 
of time, every other point of the phase space goes over to a definite new position. In other 
words, the entire phase space is transformed into itself. Moreover, this transformation is 
one to one because the position of a point at time t also determines its position at the time 
t 0 . If we keep t a fixed and vary t arbitrarily then by the example 7 of section 8.4 we see that 
the set of all possible changes of state (during t 0 — t) of the given system is represented as 
an infinite (continuous) sequence of canonical transformations of the phase space into itself, 
such that each transformation in the sequence involves an infinitesimal increment in time. 
We call the above described motion of the phase space in itself, its natural motion . 

Let M' denote a set of points in phase space with volume V(0) ^ 0 at time t = 0. 
During the natural motion of phase space this set goes over to a set M'(f) in a time interval 
t with volume V(t). Liouville’s theorem then asserts that 

no) = no 

Proof 

The transformation from M' to M'(<) is a canonical transformation which is a composition 
of an infinite sequence of infinitesimal canonical transformations. We have already proved 
that every canonical transformation preserves volume in phase space by virtue of the fact 
that its Jacobian is unity. This completes the proof. 

However, one should remember that Liouville’s theorem does not apply to the dissipative 
systems for which the constant energy surface containing the trajectories in the phase space 


Copyrighted material 



Canonical Transformations 255 


keeps on shrinking in volume due to dissipation. 


8.7 AREA CONSERVATION PROPERTY OF HAMILTONIAN FLOWS 


We shall now prove a fundamental property of Hamiltonian systems. Let (q,p) denote some 
convenient coordinates in the phase space of the system. Consider a closed curve (loop) C 
in the phase space. Then the area in the closed loop C is defined as 

A = ^ ^Pkdq k = ^<P'dq> (8.42) 


where <p,dq> denotes the scalar product YLkP^dqk- This means that the area A in Eq. 
(8.42) is the algebraic sum of all the projected partial areas on the coordinate planes defined 
by the conjugate pairs of qk,Pk (see Fig. 8.1). Every point on C moves in accordance with 
Hamilton’s equations of motion, with the Hamiltonian H(q,p,t). In other words, the loop 
C moves according to the natural motion of the phase space. The change in its area A is 
given by 

j t J> c <P,dq> = jf <p,dq > + jf <p,dq> 

Integrating second term on RHS by parts, we get, 


dA 

dt 


jf < p,dq > ~ j> c < Mp > + (< Pi 9 >)c 


(8.43) 


The last term vanishes because C is a closed loop and < p,q > is single valued. Using 
Hamilton’s equations, we get 


dA 

dt 



dq > 


as R is single valued. 


l< a -£’ d ”>=-Wc = « 


Thus the natural motion of the phase space (governed by Hamilton’s equations) leaves 
invariant the area enclosed by an arbitrary closed loop in the phase space. 

The converse of the above result is also true. Thus if a continuous transformation of the 
phase space onto itself conserves areas for all closed loops for all time then that transforma¬ 
tion must be generated by a Hamiltonian H(p,q,t). This can be proved as follows. 

Since we are now given that dA/dt = 0, using Eq. (8.43) we can write 

£ <p,dq> - j> <q,dp> +(<p,q>]c = 0 (844) 

Since Eq. (8.44) must hold for any closed loop C in the phase space we must have, 


- <p,dq> + < q, dp > = 0 (8.45) 

Equation (8.45) is an equation to some hypersurface in the 2n-dimensional phase space. The 
vector X = (- p,q) is normal to this surface at every point on it. Now at any given instant 


Copyrighted material 



256 Classical Mechanics 



Fig. 8.1 Projection* of the bounding surface of a given phase space volume 
on to the elementary phase planes 


i , let H(p,q,t) denote a single valued scalar function which is constant on the hypersurface 
defined by Eq. (8.45). Then it is well known that 

•(?■$?) 

d_ _ (d_ d_ d_\ 
dq ~ \dqi y dq2*" 'dq n ) 

Equations (8.46) are nothing but Hamilton’s equations of motion, which proves what we 
wanted. 

Thus we have proved the important result that a system is Hamiltonian if and only if the 
corresponding continuous transformation of the phase space onto itself conserves areas in 
the sense described above. 

Any continuous (usually one parameter) transformation of the phase space onto itself is 
called a flow. If a flow conserves areas then it is called a Hamiltonian flow. 

Note that the above proof did not depend in any way on whether the Hamiltonian of the 
system is time dependent or time independent, that is whether the energy of the system is 
conserved or not. Thus the area preserving property holds good for both time dependent 
and time independent Hamiltonian systems. Therefore, the net result is that if (q(t) , p(l)) 
represents the state of the system at time t and 

T, : («(0) , P(0)) - («(<) , M) (8.47) 

denotes the phase space transformation generated by the Hamiltonian, the transformation 
T t conserves areas in phase space. 

We shall now show that any continuous phase space transformation is canonical if and 


X = (- p, 4) = VH 


d_ = f d_ d_ d \ 

dp - \dpi' dpt’ ’' dpn) 


Copyrighted 


Canonical Transformations 257 


only if it conserves areas in the phase space. 

We know that a transformation 

<3 = c?(<j,p,i) and P = P(q,p,t) (8.48) 

is canonical if and only if there exists a function F(q,p,t) such that for every instant of 
time, 

dF(q,p,t) =<p,dq> - < P{q,p,t ), dQ(q,p,t) > (8.49) 

Integrate both sides around any closed loop C in the phase space, say counter-clockwise. 
Since F is single valued and C is a closed curve, the integral of the LHS gives zero so that 

/ <p,dq>= <£ < P(q,p,i),dQ(q,p,t) > = <£ < P,dQ > (8.50) 

Jc Jc Jc 

where the loop C gets transformed into C' under the transformation (8.48). Thus we have 
proved that if the transformation (8.48) satisfies Eq. (8.49) then it conserves areas in the 
phase space. Now suppose we are given that (8.48) conserves areas in the phase space then 
it is trivial to show that it must satisfy Eq. (8.49). We leave this as an exercise. 

Thus, in fact we can define a canonical transformation as a phase space transformation 
(8.48) which conserves areas in the phase space. It is also well known that a transformation 
is area preserving if and only if its Jacobian is unity. Thus a transformation is canonical if 
and only if its Jacobian is unity. This statement is stronger than the one proved in section 
8.3 where only the necessity was proved. 

The area preserving property of the Hamiltonian flows is more fundamental than its vol¬ 
ume preserving property (Liouville’s theorem). Thus Liouville’s theorem can be proved using 
the fact that the transformation T t (see Eq. (8.47)) is area preserving (that is, cauonical) 
but the area preserving property of T t cannot be proved using Liouville’s theorem. 


8.8 SUMMARY 

This is an interesting chapter. It gives us a handle to transform the Hamiltonian to have any 
desired form so that solving Hamilton’s equations in the transformed set of the generalised 
coordinates and generalised momenta becomes quite trivial. So the basic requirement for this 
lias been to allow only those phase space transformations that leave the form of Hamilton’s 
equations of motion invariant. The class of such transformations are called canonical, or 
equivalently, contact transformations. These two connotations differ only in the style of 
their definitions. They are called canonical to signify that the canonical equations of motion 
remain by condition invariant in form. The connotation contact signifies continuation of the 
property of touching any two curves in both the original and transformed phase spaces. The 
definition of the second type is in fact independent of the knowledge of any Hamiltonian, 
which it should be, because the transformation equations themselves do not have anything 
to do with any Hamiltonian. However, one can easily check that the use of an Hamiltonian in 
testing canonicality is like the use of litmus paper in testing the acidity of a given solution, 
which is not supposed to influence the nature of the solution it tests. In fact one can 
use any Hamiltonian, that is, any arbitrary function of momenta and coordinates to test 


Copyrighted material 



258 Classical Mechanics 


the canonicality. The contact behaviour does not involve time, and this property being 
geometrical must be satisfied at any given instant. 

All canonical transformations form a group, which means that for any given canonical 
transformation an inverse CT exists, that an identity transformation is a CT, that any two 
successive CTs can be regarded as a single CT, and that three consecutive CTs also satisfy 
the associative property. The group properties of the CTs essentially give all CTs a feeling 
of belonging to a family. 

All the various examples of CTs in different contexts of physics show their versatility. 
Obviously from the examples given in section 8.4 it is apparent that all these CTs define 
some kind of momenta and Hamiltonian, which may or may not have any connection with our 
regular notions of momentum and energy, yet they all are seen to satisfy a set of first order 
differential equations called Hamilton’s equations of motion. So these merely constitute a 
unified viewpoint that the CTs have brought into physics. Once again, we must remember 
that all CTs form a group, and therefore, so many isolated events of physics given in the 
form of examples of CTs in section 8.4 are assembled into a family. 

The seventh of the above examples uses a CT that can transform the initial coordinates 
and momenta to the coordinates and momenta at a later instant infinitesimally next to it. 
So the whole course of evolution of a dynamically conservative system can be regarded as 
the gradual unfolding of a contact transformation. The Hamiltonian quietly changes from 
H(q(t),p(t)) to H(q{t + A t),p(t -f Af)), as is expected in such a case. 

In section 8.5, we have illustrated with an example how the motion of a charged particle 
in a uniform electric field could be viewed as a motion of a free particle in a new system of 
phase space coordinates effected by a suitable CT. 

The last two sections deal with general theorems regarding the behaviour of phase space 
under CTs. Certain properties do remain invariant. Liouville’s theorem guarantees the 
invariance of the volume of phase space spanned by a conservative dynamical system. The 
other theorem, namely the area theorem, is more general, and it asserts that the sum of 
all the projected areas on the individual conjugate planes in a phase space under any given 
closed curve remains unchanged as the system evolves with time. 

PROBLEMS 

8.1 Show that the identity transformations can exist only in the form of F 2 (P, q) = P,g, 

and F 3 (p, Q) — - PiQi, not for F\ = F t (q, Q) and F 4 = F 4 (p, P). 

8.2 If two CTs 

(«. p ) 5 ( 0 , P) 

and 

«?. p) q HQ\ n 

are generated by F 2 (q, P) and G 2 (Q, P'), show that 

(«. p) $ (O’. P') 

is also a CT generated by H 2 (q, P') = F 2 ($, P) + G 2 (Q, P') - EiP.Qi- 


Copyrighted material 



Canonical Transformations 259 


8.3 The reflection about the X 2 -X 3 plane passing through the origin, that is, x, = 
— Xi, x 2 = X 2 and X 3 = X 3 is canonical and is generated by i* 2 (x, P) = 
— XiP\ + X 2 P 2 + X3P3. How does the momentum vector transform? 

8.4 A general rotation about the origin, x, xj = RijXj, is described by the 

rotation matrix R = t, j = 1, 2, 3. Find the generating function 

F 2 (ii, pj) and the transformation equations for the momentum components. 

8.5 Show that the Cartesian to polar coordinate transformations in phase space if intro¬ 
duced only for the coordinate sector given by (xj, X2, X3, p\ P2, P3) —* (r, 9, <f >, p ], 
P2, P3), is not canonical, but the transformation (xj, X2, X3, p\ P2, ps ) —» 
(r, 0, <j>, p r , pe, p<f>) is, being effected through a generating function F3 = 
F3(pi, P2, P3, r, 0 , <f>) = —(pirsin^cos^ + p2rsin0sin^ + p^rcosO). Find 
Pr, Pe, P<t> and evaluate for p 2 = p 2 + p\ 4- p| in terms of p r , pe, p$. Also 
check whether the Jacobian for the second transformation is unity. 

8.6 Apart from the conditions of canonicality stated already in section 8.2.4, prove that 
the following conditions are also valid for testing canonicality of a phase space trans¬ 
formation given by Qi = Qi(q,p,t) and P, = P,(q,p,t): 

(i) 

Sr(dQ±<>Ei _ dQfdPA _ c. 

J \ dqt dpk dpk dqk ) ** 

^(d&dQjj _ dChdQA m 

^ V dqk dpk dpk dqk ) 

_ dKdPA = 

“ \dqk dpk dpk dqk ) 

The meaning of these expressions in terms of the Poisson’s brackets will be clear when we 
move on to the next chapter. 

(ii) If 6 be a symbol for an independent set of small increments (of coordinates and 
momenta) and d be a symbol for total increments in the sense of the total differentials (of 
coordinates and momenta), then show that 

^2^FkdQk - dPkSQk) = ^(£p*<fy* - dpkSqk) 

k k 

This is called the bilinear covariance of the Pfaffians T,kPkdQk and E*p*dg *, where P* = 
Pk{Q\, ",Q n ) and pk = p*(gi,...,g„). 

8.7 Prove that the following transformations are canonical: 

(i) Q = p " 1 P = qp 2 

(ii) Q ~ q a cos (ip P = q a sin (ip only if a = £ and 0 = 2 

(iii) Q = tan~ 1 (aq/p) P = |ag 2 (l + p 2 /a 2 q 2 ) for any constant o 

(iv) Q = ln(l + y/q cos p) P = 2(1 + y/q cos p)y/q sin p 

(v) Q = V^e'cosp P = y/Tqe^sinp 

(vi) Q = aq + bp P = cq + dp only if ad — be = 1 


Copyrighted material 





260 Classical Mechanics 


(vii) Q = qtanp P — In sin p 
(viii) Qi = pi tan t Pi = q, tan t 

(ix) q = P 2 + « 2 p = 1 ta n-HP/Q) 

(x) pi m PiEpj Qi = ?,EpJ - 2 PiEprfj 

8.8 Determine the canonical transformations defined by the following generating func¬ 
tions: 

0) = hmw{t)q 2 cot Q 

(ii) F,(g,Q,t) = - F(t)/mu> 2 ] 7 cot Q 

(iii) Fi{q,Q) = qQ - \muq 2 - Q 2 /4mu; 

(iv) F 3 (Q,p) = -(e« - l ) 2 tan p 

What happens to the Hamiltonian of a simple harmonic oscillator when transformed 
from p, q coordinates to P , Q coordinates for each of these transformations? 

8.9 (a) Show that the transformation defined by the equations, 

Q\ = 9? + A 2 p 2 

ft = 5^(5? + «? + +A 2 pl) 

2APi = tan - 1 (gj/Api) - tan -l (g 2 /Ap 2 ) 

Pi = Atan _ 1 (g 2 /Ap 2 ) 

is a contact transformation, and that it reduces the original Hamiltonian 
H = i(p? + pi + «;/A 2 + „f/A 2 ) 

to its transformed form 

K = Q 2 

(b) Similarly show that the transformation defined by 

3, = Ar I/ 2 ( 2 Q,) 1 / 2 cosP, + A- ,/ 2 ( 2 Q ! )'^cosP ! 

« = -Ar' / 2 (2« 1 ) ,/J co S P, + A 2 - 1 ' 2 ( 2 Q 2 )‘/ 2 co 8 P 2 
p, = i(2A 1 (?,)'/ 2 sinP, + i(2A 2 Q 2 )'/ 2 sinP 2 
p 2 = - “(2AiQi ) l/2 sin Pi + i(2A 2 Q 2 )'/ 2 sinP 2 
changes the Hamiltonian 

H = p] + pi + Aj(gi - qi ) 2 /8 + Aj(gi -|- g 2 ) 2 /8 
to K = AjQi + A 2 Q 2 . 

Integrate the equations of motion and express the solution in terms of the original equations. 

8.10 Show that the generating function Fi(q,Q) = - \mgr{q + Q) — \rn(q - Q) 2 /t 
produces a canonical transformation which consists in changing the coordinate q(t) 
and momentum p(f) to Q(t) = q(t + r) and P(t ) = p(t -f r), where g and 


Copyrighted materis 



Canonical Transformations 261 


t are both constants, for the 1-D motion of a particle of mass m moving in a field 
of uniform gravity having the potential energy V(q) = — mgq. 

’8.11 Consider the Hamiltonian for small oscillations of an anharmonic oscillator of unit 
mass to be H(q,p) = p 2 /2 + u> 2 q 2 /2 + atq 3 + fiqp 2 under the assumption that 
aq < lj 2 and 0q 1. Find the parameters a, 6 for the canonical transformation 
produced by the generating function Fi(q,P) = qP + aq 2 P + bP 3 such that 
the new Hamiltonian does not contain any anharmonic terms up to the first order in 
aQ/u 2 and 0Q. Determine the solution for g(<). 

8.12 If an infinitesimal segment of a curve (dy, dx) in a 2-D space, having slope p is 
transformed to (dK, dX) with a new slope P, find the transformation and justify 
why such a transformation should be called a contact transformation. 


Copyrighted material 



9 

The Poisson Bracket 


9.0 INTRODUCTION 

Usually the Poisson bracket relations are included in the chapter dealing with the canonical 
transformations. In fact one of the test criteria for the canonicality of a given phase space 
transformation is a set of fundamental Poisson bracket relations. So we would recommend 
reading both these chapters before solving some of the problems suggested at the end of the 
last chapter. Again, since the Poisson bracket relations have been used in toto in defining 
the commutator relations in quantum mechanics, one should really read this chapter with 
care. At the end of chapter 10, we have included a section that deals exclusively with 
some of the classical-quantum analogies, where we shall show the link between the two. 
The important aspect of the Poisson bracket relations is that they are invariant under the 
canonical transformations. 

Simeon Denis Poisson (1781 - 1840) is one of the few people who in the early nineteenth 
century contributed to dynamics not just as a mathematician but also as a physicist. When 
he was 17, his genius was quickly recognised by Lagrange, whose course in analysis was 
attended by Poisson. He is celebrated for his theoretical contributions to electricity and 
magnetism, elasticity, calculus of variation, differential geometry, theory of probability, sur¬ 
face tension, diffusion of heat, celestial mechanics, etc. He joined as a demonstrator at the 
lilcole Polytechnique in 1800, became an assistant to Fourier and immediately succeeded 
the latter to his professorial chair. In 1808, he was appointed astronomer at the Bureau 
des Longitudes and succeeded Laplace as its chief mathematician in 1827. Among other 
contributors to the field of Poisson’s bracket, the notable ones are Jacobi and Lagrange, but 
only in the present century did the topic enjoy extreme popularity, because of its quantum 
correspondence, mainly through the works of Heisenberg, Ehrenfest and Dirac. 


9.1 DEFINITION 

A Poisson bracket (PB in short) is a special kind of relation between a pair of dynamical 
variables (that is, measurable physical attributes) of any holonomic system, which is found 
to remain invariant under any canonical transformation. Their main utility lies in the fact 
that they can be used to construct new integrals of motion from the known ones. Poisson 
brackets are the classical analogues of commutation relations between operators in quantum 


Copyrighted material 



The Poisson Bracket 263 


mechanics. Historically the commutator relations in quantum mechanics were defined in 
analogy with the already existing classically defined Poisson brackets. The concept of the 
Poisson bracket was introduced by S. D. Poisson as early as 1809 and is defined as follows: 

Given any two dynamical variables u(p,q,t) and v(p, q,t), the Poisson bracket of it and 
t; is defined as 


M](m) 


^ ( du dv du dv\ 

Opt dpidqi) 


(9.1) 


where the suffix ( p,q ) refers to the set (pi, 0 *»O of independent variables pertaining to a 
holonomic system with the number of DOF = n, with respect to which the P.B is evaluated. 
The suffix (p, q) can be dropped, provided no ambiguity arises from doing so. (Some books , 
define PB as a negative of the definition given in Eq. (9.1), that is, with each term having 
the opposite sign.) 


9.2 SOME USEFUL IDENTITIES 

Throughout this chapter u(p,q,t), v(p,q, t) and w(p,q,t) are assumed to be any three 
dynamical variables pertaining to a holonomic system with the number of DOF = n whose 
generalised coordinates and momenta are denoted by the set (qi,Pi). The following identities 
can be easily proved. 

(i) The PB of any two dynamical variables is anticommutative. 


I«,i>] = - [t>,u] 


As a corollary we have 


[u,u] = - [ti,tt] = 0 

(ii) If c is a constant, that is, not a function of (p, q, t) then, 


(9.2) 


[ctt,vj = [u,ct;] = c(u,v] (9.3) 

(iii) The PBs also satisfy the distributive property 

[u + v , tuj = + [u,iw] and [u,uuj] = [u,u]to + v(u,u;j (9.4) 

(iv) The partial derivative of any PB relation can be shown to satisfy 

= [£,.) + fc£l (9-5) 

(v) A famous identity called Jacobi’s identity is given by 

[«,[», u;]] + [v,[u/,u]] + [w, (it, t>]] = 0 (9.6) 

(vi) Let tt>i, to 2 ,...,itf„ be a set of dynamical quantities (all functions of p,q,t) and let 
F( twi,...,tu„) be a differentiable function of wi, W 2 ,...,w n . Then, 


r *?/ m &F, , dF. . dF , 

.”’“ )l = 


Copyrighted material 



264 Classical Mechanics 


9.3 ELEMENTARY PBs 


The PBs constructed out of the canonical coordinates and momenta themselves are called 
elementary PBs. It is trivial to show that 


= o = (pi,Pj) 

and [ 9 i,Pj] = - [pj,?<] = &ij 

(9.8) 

We also have 

, , du 

= - w. 

and [u,pi] = ^ 

(9.9) 

Equations (9.9) imply, for Cartesian coordinates 


[u,r] = - V p u 

and [u,p] = V r u 

(9.10) 


Thus, by replacing u(q,p,t) in Eq. (9.9) by the Hamiltonian function H(q,p,t) one obtains 
Hamilton’s equations of motion in terms of the PBs: 


Qi = and p< = [tf.pi) (9.11) 

For a single particle, and in terms of Cartesian coordinates, Eqs (9.11) take the following 
form 

f = [ff,rj and p = [ff,p] (9.12) 

Now, since p<, qi are explicit functions of time it is possible to invert these relations, namely 
Pi = pi(t) and qi = qi(t) to get t as a function if p, and g,. Thus for t as a dynamical 
variable expressed as <(pi,fl»)> we can write 


^ - fig - 


dt dH 

dpi dqi 


8t dqj 
dqi dt 


8t dpi 
dpi dt 


dt 

dt 


9.13) 


One can easily identify all the quantum analogues of the PBs expressed in Eqs (9.8) - (9.13). 
We shall now prove two important theorems. 


9.4 POISSON’S THEOREM 

The total time rate of evolution of any dynamical variable u(p,q,t) is given by 

Tt - I + !“'*! < 9 - 14 > 

Proof: Starting with the left-hand side, 

du _ du dt t . du . du du dH du dH du 

dt = dt + + dii Pi = ~dt + djldfi ~ dplWi = dt + [u ' H] 

Thus if u is a constant of motion so that du/dt = 0, then by Poisson’s theorem, 

% + 1*,*] = 0 

Furthermore, if u does not contain time explicitly, that is, du/dt = 0, then (u,R)' = 0 


Copyrighted material 




The Poisson Bracket 265 


is the required condition for u to be a constant of motion. 

Now it is easy to check for any given Hamiltonian of the form H = H(q,p,t) that one 

* - * 

apart from Hamilton’s equations of motion as obtained from Eq. (9.11). 


9.5 JACOBI-POISSON THEOREM (OR POISSON’S SECOND THEOREM) 
ON PBs 

If u and v are any two constants of motion of any given holonomic dynamical system, their 
PB is also a constant of motion. This is called Jacobi-Poisson’s theorem or Poisson’s 
second theorem on the PB relations. 

Proof : Consider 

j t WM = ^M1 + [(«>«)>#] 

Using Eqs (9.5) and (9.6), we get 

|mi = i^.»i + !«.§■[) - 

- + i».*i] 

.du , . dv. 

m + 

= o 

because both du/dt and dv/dt vanish by the requirement that u and v are both constants 
of motion. 

This theorem has profound significance for determining new constants of motion. To 
start with if we have got any two independent constants of motion, then a third one can be 
constructed from the PB of these two, which may result in either a new (that is, independent) 
constant of motion, or trivially either of the first two. If the former is true, we may make 
another pair of new PBs and if we are lucky, we can, in this way generate all the hidden 
constants of the motion. It should be remembered that a dynamical system having n 
degrees of freedom can have at the most 2n-1 independent constants of motion, which are 
functions of and q,’s only, and one constant of motiou that must involve time explicitly. 

Examples 

1. Consider an isotropic 2-D harmonic oscillator that has the Hamiltonian of the form 

H{x,y,p x ,p y ) = j ^(pi + p)) + ^k{x 2 + y 2 ) 

Being two dimensional it has only three independent integrals of motion. One is of course 
the energy integral E, since H does not contain time explicitly. Again, since the force field 


Copyrighted material 



266 Classical Mechanics 


here is a central one, the angular momentum perpendicular to the x - y plane 
L = xp y - yp z 

is also conserved. To construct a third one, note that H can be decomposed into two parts 
each corresponding to a 1-D harmonic oscillator and conserving its own energy component 
so that the difference of energies of these two components is also a constant of motion. 
Therefore, 

- p\) + 5 »(** - V 7 ) = B (say) (9.16) 

is a constant of motion. 

Now let us construct a new integral of motion say C from L and B , using the Jacobi- 
Poisson theorem given by 

2 

C = [L,B] = [xp y - yp x ,B] = —(P*Pv + mkxy) (9.17) 

C is obviously independent of L and B , but not simultaneously of L , B and H , as 

H 2 = E 2 = B 2 + i<7 2 + u i 2 L 2 (9.18) 

where u = y/k/m. One can also check that 

[C,L] = 4 B and [C,B\ = - AkL/m 

We get back B and L respectively, and hence no further independent constants of motion 
can be generated in this process. 

In order to find the nature of C one has to write down the complete solution for x and 
y, given by 

x = osiu[u)(< - t 0 )] V = 6sin[a>(t - t 0 ) + 0] (9.19) 

where 0 is the constant phase difference between the two. The four constants of motion 
here are o, 6, 0 and t n . One should remember that only three of these can have no explicit 
time dependence in their expressions. Now from Eqs (9.17) and (9.19), we get 

C = 2abmu 2 cos0 

2. The constants of motion of a 2-D isotropic harmonic oscillator are, however, best 
studied by the Runge-Lenz tensor of second rank defined by 

A H = 2^P<P> + \*XiXj (9.20) 

Since we know that dpi/dt = — kxi are the equations of motion of this system, they 
would immediately satisfy dAij/dt = 0, which means that .A./s are all constants of motion. 
For a 2-D isotropic harmonic oscillator as expected, An, Au (= A 2 1 ) and A 22 are the 
only three independent constants of motion. Therefore, 

The trace of the A matrix = An + A 22 = H = constant and 


Copyrighted 



The Poisson Bracket 267 


The determinant of the A matrix = AnA 2 2 - A\ 2 
are just two other dependent constants of motion. 


u 2 L 2 

4 


const. 


0.6 INVARIANCE OF PB UNDER CANONICAL TRANSFORMATIONS 


Consider a canonical transformation (8.5) and two dynamical variables u(p, q, f) and v(p , q , t ), 
so that they transform into u(P,Q y t) and v(P,Q,t), given by 

u(p,<M) = u(p(P,Q,t)AP,Q,tM = *(P,Q,t) 

and 

t i(p,q,t) = v{p(P,Q,t),q{P,Q,tU) = V.Q.0 

We want to show that 

(*»®1<m) = (M}(P,Q) (9.22) 


Proof 


In what follows we use Maxwell’s relations (8.15) to obtain 

' du _ du dQj du dPj _ du dpi du dpi 

dqi ~ dQ~j~dqi + ~dPj~dqi ~ dQ’jWj ~ WjdQ, 


and 


du 

d Vi 


du dQj 
dQj dpi 


du dPj 
dPj dpi 


du dqi du dqi 

WjdPi + WjdQj 


Therefore, 

[ U > V )(p.9) 


du dv du dv 

dqi dpi dpi dqi 

du dpi dv du dpi dv 
dQidPjWi ~ WjdQ~jdpi 


du dqi dv 


du 

Wj 


dQj dPj dq , 

_ du \ dpi dv dqi dv 1 du T dpi dv dqi 

~ Wj [dP'jdii + dPjWi\ " dPj IWjdpi + Wj 

du dv du dv 

~ WjdPj dPjWj 

= (*.®](P,Q) 


dqi dv 

Wjdql 

dqi\ 


which is what we wanted to prove. 

This result has also got far reaching significance. We know that the electromagnetic 
gauge transformation, Lorentz and Galilean transformations are all CTs. Hence any PB 
defined for a pair of dynamical variables must remain unchanged for all the above general 
transformations. So any vector or scalar quantity that is expressible as a PB must remain 
invariant under rotation, translation, GT, Lorentz transformation and so on. 


Copyrighted material 




268 Classical Mechanics 


9.7 PBs INVOLVING ANGULAR MOMENTUM 


We know that the canonically conjugate momentum corresponding to any angle variable 
represents a component of angular momentum. Nevertheless, the total angular momentum 
of a system is best defined in terms of the Cartesian coordinates and momenta of the 
individual particles. Let us consider a one-particle (or at most an effectively one-particle) 
system with the sth Cartesian component of its total angular momentum vector, given by 
£,• = tijkXjpk . Now PBs formed between any pair of the components, say Li and Lj are 
found to satisfy the relation 

[L it L^ = e ijk L k (9.22) 

When expressed in terms of x, y, z components, the relation (9.22) actually stands for 
(£*,£,] = L m [£„,£«] = L x (£„£, J = L y and 

[L X ,L X ] = [£„£„] = [L zt L, j = 0 

The PBs between a Cartesian component of L and any of the Cartesian components of r 
and p also satisfy the following additional relations: 

[Li,Xj ] = UjkXk and [L iy pj\ = u jk Pk (9.23) 


All the PBs that are expressed in Eqs (9.22) and (9.23) are proved as follows. For example, 
I Li,Xj] = ( dkiXkPhXj ) = -eikiXkSjt (using Eq. (9.7)) 

= - dkjXk = djkXk 


and 


\Li,Lj) = [tinkXuPk^jlmXlPm] 

= e*»* e jltn [x n Pk, Xip m ] 

= tnki€jlm{XlPk[Xn,P, i, uPm[Pfc,Xj]} 
= Cnkiejlm{f>nuiXtPk ~ SklX„p m } 


= tnkitjlnXlPk — t n kitjkmX n p m 


— CnmiCjlnXlPtn ClkitjkmXlPm 
= [tnmiCnjl ~ C kilt km] )XlPm 
= [hfujSil - 6, n lSij - S im Slj -f 6ijSl m )X(P m 
= (hilSjm - Si m Sjl)Xlp m 

= e kij£ k lmXtPm 

= eijkLk 


One simple meaning of this last result is that if we have any two Cartesian components 
of £ as constants of motion, then the third component and hence £ as a whole must be 
constants of motion. This result can furthgr be generalised because of the following PB 
relation, given by 

(a •£,&•£] = (a x b) • £ (9.24) 


Copyrighted 



The Poisson Bracket 269 


where a and b are any two constant vectors. So if a and b represent any two unit 
vectors along which the components of L are separately conserved, the component of L 
perpendicular to the plane formed by a and 6 is also conserved. 

Next we would like to show the connection between rotation and angular momentum 
through the PB relations. Consider any scalar function <j> = and an infinitesimal 

rotation of the coordinate system about any arbitrary direction h passing through the 
origin, by an angle 66. This results in 

r—*r + 6r = r — 86 hxr 


p->p + 6p=p - 69 hxp 


and 


<t> -* 4>(r + 6r, p + 8p) = <f>(r , p) 

as the scalar function <f> by definition should remain unchanged under rotation of coordi¬ 
nates. Now, if the scalar function <f> does not contain any other vectors than r, and p, 


d<f> - d<}> (d<f> d<f> \ 

W ,** 1 + ^Sp, = - Tlj 


= -69 


d<f> d(e l}k XkPi) _ d<j> d(e Jlk x,p k ) 
dii dpi dpi dxi 


n^Ljnj] 


Since 69 is arbitrary, for any systemic scalar function <f> = p), 

[<f>, L • n] = 0 (9.25) 

Similarly, for any systemic vector function / = /(r, p), it can be shown that 

[/, L • n] = n x / (9.26) 

a s 6f = — 60 h x f. In terms of the t, j , k notation, Eqs (9.25) aftd (9.26) may be 
written as 

[*,£,] = 0 (9.27) 

[/.,£,) = 'ijkfk (9-28) 


Since these results are valid for any arbitrary' small rotation of the coordinate system, as 
well as for any arbitrary choice of the functions <f>(r, p) and /(r, p), the PB relations 
given by Eqs (9.27) and (9.28) do in fact reflect the fundamental structural properties of 
the 3-D Euclidean space. Furthermore, the PBs expressed in Eqs (9.22) - (9.24) are merely 
some special cases of the relation (9.28). Some more examples of the use of the relation 
(9.27) would be 

[rp,Li] = 0 (9.29) 

and 

[Li,H] = 0 (9.30) 

provided H is a scalar function of only vectors r and p, which is obviously satisfied by a 
central force problem, otherwise see Prob no. 9.1 (iii) for general solution. 


Copyrighted material 




270 Classical Mechanics 


0.8 DIRAC’S FORMULATION OF THE GENERALISED HAMILTONIAN 


For a holonomic system having the number of DOF = n, we have 2n linearly independent 
generalised coordinates and velocities 91 , &,• -,9n;9i, 9 a,---.An. In order to pass from 
the Lagrangian to the Hamiltonian formulation one requires that all the conjugate momenta 
defined by 

dL , o 

* = % • - 2 '-’ n 

be linearly independent of one another. The above requirement is, however, not always 
fulfilled. For example, in the covariant formulation of the (special) relativistic dynamics, 
the four coordinates z M (/z = 1, 2, 3, 4) are linearly independent but the corresponding 
four momenta p M (p = 1, 2, 3, 4) are not, simply because of the constraint relation, 
PuPp = - TOqC 2 . Dirac in 1950 gave a general formulation for the Hamiltonian dynamics 
of such systems. 

Suppose that there are m independent constraint relations involving momenta and co¬ 
ordinates only, to be given by 


9i{qu ftPi, P 2 ,...,Pn) = 0 i = l,...,m < n 


and 


Pi = 


fl£(9l> 92, •• • > 9ni 9l> 92, • • • > 9w ) 
dq> 


j = l,...,n 


(9.31) 


(9.32) 


For small arbitrary variations 6q, and 6qj in the coordinates and velocities, the relations 
(9.31) will impose m restrictions on the variations of 6pj\ 

d 9i; 


= 0 


* = 1 ,. 


(9.33) 


Now since the Hamiltonian H = #( 91 ,..., 9 „; pi,.• .,p n ) = Pjqj - L, we have, 
dH e dH . 

= u 

= PMi + Qihi - J^*9, - jr-Hi (9.34) 

. , dL , 

= qjtPi - 9; 

where all Spj are not arbitrary. In order to make all Spj and Sqj arbitrary one may combine 
Eqs (9.33) and (9.34) with the introduction of m Lagrange multipliers say Cj,...,C7 m , 
such that 

* - 4 + Ci W, = H + Ci9il 


and 


dL 6H dqi 

J = ^ + Ci 4 = l ' + Ci *'" ] 


(9.35) 


Copyrighted material 



The Poisson Bracket 271 


Equations (9.35) are called Dirac’s generalised form of Hamilton’s equations of motion 
in which the Hamiltonian H = qiPi - L is effectively replaced by H + Cigi = 


piqi - L + C^i. 

Now for any dynamical variable / 


df 


^ _ d£ 
dt dt + d qj qj 


df. 


/(<71, <72,-..,9n? Pit... t Pm t)t 

- Jj* = Tt + + c ‘*' 


(9.36) 


Finally one may note that in order to quantise any classical system, a general formu¬ 
lation in terms of PBs and the Hamiltonian is quite essential, as it becomes very easy to 
write Schrodinger’s equation once the explicit form of the Hamiltonian is known, or the 
commutation relations can be formulated, provided the classical PB relations are known. 


9.9 LAGRANGE BRACKET (LB) 


Another class of useful relations between dynamical variables are the so called Lagrange 
brackets introduced by Lagrange in 1808. For any two independent dynamical variables 
u (p»9»0 and v (PiQit) pertaining to a dynamical system with the number of DOF = n, 
their Lagrange bracket is defined as 




A ( dqj dpi 

;\du dv 


HSiHEi) = v Pi) 

dv dv. ) 0(ti, v) 


Clearly the Lagrange bracket is antisymmetric: 


(9.37) 


(t*,t>) = - (v,u) 


(9.38) 


Let us suppose that there are 2n independent functions t*i, u ^,... ,u 2n of the variables 
(qi, Pi)- Conversely, qi,..., 9 n ; pi,...,p„ may be regarded as functions of , Uj,...,t*2 n - 
From their definitions one may suspect that there is some relation between Poisson brackets 
[tt r ,u s ] and Lagrange brackets (u r ,u,). Indeed so; for let us consider 

In 

£[u r ,tti](u r ,1ii) 


_ v-* v'"' / du r dm du r dui \ / dqi dpi dqi dpi \ 

) ~ x \ dqk dpt dpt dqk ) \0u r dvj duj du r ) 

But 

du r dqi du r dpi 

^d^frTr = ^ x Wk~for = W 

where Ski is the Kronecker delta matrix. Moreover, 


du r dpi du r dqi 

r = 1 ^ k ® Ur r = 1 ^ Pk ® Ur 


(9.39) 


(9.40) 


(9.41) 


Copyrighted material 




272 Classical Mechanics 


Hence, Eq. (9.39) reduces to 


A / dm dp k dm dq k \ 

r=A d Qkduj) 


(9.42) 


If we regard the PB [u r , u<] as the element P* of the 2n x 2n matrix [P] and the LB 
(u r ,u,-) as the element L r j of the 2n x 2n matrix [LJ, then Eq. (9.42) can be written in 
the matrix form. 

[P\ T [l\ = [/,„] (9.43) 

where [/ 2 „] is the unit matrix of order 2n and [P] T means the transpose of [Pj. Since the 
determinant of the product of two matrices is the product of their determinants, it follows 
from Eq. (9.43) that the determinants of matrices [Pj and [L\ are reciprocals of each 
other. It is also clear from Eq. (9.43) that one type of bracket determines the other so that 
if the Poisson bracket is invariant under canonical transformation the Lagrange bracket is 
also invariant. 


The Lagrange or Poisson brackets can be used to test whether a given transformation is 
canonical. If the transformation Qi = <?,(g,p,f,), P,- = P,-(g,p,i)> i = 1, 2,...,n 

is to be canonical, the new variables Qi and P< must satisfy Eq. (8.17). Since the old 
variables ( qi , pi ) may be regarded as functions of the new variables, the necessary and 
sufficient condition for a transformation to be canonical is that, for a fixed value of time, 
the expression 



- *)«r + 


be a perfect differential. The conditions for this are 


(9.44) 



and 


-w(g"30 


d_ 

dPs 




However Qi and Pi are independent variables, so that conditions (9.45) can easily be shown 
to reduce to 


(Qr,Qs) = o (P r ,P„) = 0 and (<? r ,P„) = Sr. (9.46) 

The necessary and sufficient condition that a transformation be canonical can also be 


Copyrighted 



The Poisson Bracket 273 


reduced to the requirement that for a fixed value of time the expression 



- p -) + 


(9.47) 


be a perfect differential. Proceeding in an exactly similar fashion as we did to obtain condi¬ 
tions (9.46) we can see that the requirement that expression (9.47) be a perfect differential 
is equivalent to the conditions 

[QrM = 0 [Pr,P. ] = 0 and [Qr,P»] = Sr. (9.48) 


We see that the conditions (9.48) for the canonicality of a given transformation are truly 
independent of the knowledge of any Hamiltonian or of any specific property of the dynamical 
system, except for its being holonomic with specified degrees of freedom. 


9.10 SUMMARY 

Poisson bracket relations and some of their properties (1809) were known about a quarter of 
a century before Hamilton’s equations of motion were formulated (1835) or the properties 
of the contact transformations were proved by Jacobi (1837). The idea of the Lagrange 
brackets was introduced by Lagrange in 1808. In fact some books introduce them before 
the Poisson brackets. 

It should be noted that some text books define the Poisson bracket relation in the reverse 
order of the partial derivatives with respect to and p*. As a result there is an interchange 
of signs between the two sets of terms. 

The fundamental Poisson brackets, such as (P<,Pj) = [Q%,Qj] = 0, are 

extremely useful for testing the canonicality of any given phase space transformation. 

Poisson brackets can be a useful tool for finding out some of the hidden constants of 
motion. They are also useful in describing the infinitesimal contact transformations for 
rotation, translation or even in terms of time evolution of a system. 

An important application of the concept of PBb is illustrated in Dirac’s formulation of 
generalised Hamiltonian systems, wliich is actually the starting point of the modern study 
of constrained dynamics. Dirac has written a book on this fascinating subject. 


PROBLEMS 

9.1 Show that operationally the vector product of any two vectors, the PB of any two 
dynamical variables and the multiplication of any two n x n matrices share all the 
rules in common without exception. Evaluate 

[[[A,B],C|,B| + [\\C,D\'A\,B] + [|[BM|,B],C] 

and 

[|[A,B],C|,B| + [|[B,C],B],A] + |[[C,B|,A|,B] + ([(BMUl.d 


Copyrighted material 



274 Claaical Mechanics 


9.2 Evaluate the following Poisson brackets, 

(i) [ft • r , b ■ p] 

00 ((« -r)», p] 

(iii) [L , H\ and 

(iv) [/.£,f£] 

where a and b are constant vectors, and / = /(r,p), g = $(r,p) and H = 
p 2 /2m + V(r). 

9.3 Evaluate [£,-, Ajk) and [Aj k , 

where L = r x p and Aij = x,Xj + p,pj. 

9.4 A charged particle carrying an electric charge e is moving in an inhomogeneous field 
of magnetic induction B. Show that any two Cartesian components of its velocity 
satisfy the following PB relation 

l*» V A = A UjkBk 

m 


9.5 Using PB relation show that the modified Runge-Lenz vector 
A — 1 (Kp 2 r/r + L x p) 


for planetary motion is a constant of motion, where K = G(M + m), p = 
pv, L = r x p, and p = reduced mass of the system = mM/(M 4- m). Evaluate 
[ Ai , Aj], [Li, Aj] and [A,, H}. 

9.6 Prove that the value of any function /(p(f), g(t)) of coordinates and momenta of a 
system at time t can be expressed in terms of the values of p and q at t = 0 as 
follows : 

/(>*<).«( 0) = U + + £[[/.,*),*] 

+ + ... 

where /„ = /(p(f = 0), q(t = 0)) and H = H{p[t = 0), q[t = 0)), the latter 
being the Hamiltonian at t = 0. 

9.7 For an infinitesimal transformation Pi = pi + Ap t , Q, = q, + A q,, governed 

by F 2 = F 2 [P,q) = EP,qi + u(P,q), where p { = pi(0) and qi = qi(6) are 
functions of a continuous time like parameter 0 , show that 


dpi 

de 


dw 

Oqi 




and 


dq± 

d9 


dw 

Wi 




where w is defined through u(pi(0 -I- S0), qi(0)) = w(p,(0), g,(0))£0, and is called 
the generator for the infinitesimal transformation. Using the above relation and the 
properties of PBs show that for any / = f(q,p) 


% = l fM 


and 


fl 

dO* 


[[/,«],»] 


Copyrighted material 



The Poisaon Bracket 275 


9.8 Given the canonical transformations Qi = Q*(g,p,<), P% = Pi(q,P,t), and their 
inverse transformations effected through the generating function F 2 = F2(q,P ) t) J 
show that 

L 0*21 OQi „ 8F 2 ] dPi 

r 4 ’ dt\ ~ at r*'’ et \ “ dt 


Using these results show further that 

f - ra.*i 

where K = if + dF 2 /dt. 


and 


f = 


9.9 Show that the time reversal transformation given by Q = q, P = - p and 
T — — t, is canonical, in the sense that the form of Hamilton’s equations of motion 
is preserved, but does not satisfy the invariance of the fundamental Poisson bracket 
relations. Hence these two criteria are not equal in all respects. 


Copyrighted 



10 

Hamilton-Jacobi Theory 


10.0 INTRODUCTION 

The Hamilton-Jacobi theory is usually considered to be the most intricate part of classical 
dynamics. So we have tried to explain the difficult parts as much at length as possible, 
with a sufficient number of worked out examples. We would like to urge you to acquaint 
yourself with this very important and most powerful analytical method of solving dynam¬ 
ical problems. No less a mathematical intellect than Jacobi has contributed significantly 
to simplifying the Hamiltonian approach to solve for the required characteristic function. 
About Jacobi it is said, ‘for sheer manipulative ability in tangled algebra Euler and Jacobi 
have had no rival, unless it be the Indian mathematical genius, Srinivasa Ramanujan, in 
our century’, quoted from E. T. Bell’s Men of Mathematics. 

Carl Gustov Jacob Jacobi (1804 - 1851) mainly worked on the theory of elliptic func¬ 
tions, elliptic integrals, determinants, numbers, etc. His investigations on first order partial 
differential equations were published posthumously (he contracted small pox and died sud¬ 
denly) in his treatise on Dynamics. Other great minds who have contributed substantially 
to applying the HJ theory to conservative periodic systems were C. E. Delaunay, P. Stackel, 
T. Levi-Civita, J. M. Burgers, P. Epstein and K. Schwarzschild. 


10.1 SOLUTION TO THE TIME DEPENDENT HAMILTON-JACOBI EQUA¬ 
TION AND JACOBI’S THEOREM 


We write the time dependent Hamilton-Jacobi (HJ) equation for a holonomic system having 
n DOF as 


H 



dW 

a«T" 


dw\ 

'dqn) 


+ 


dW 
di ~ 


0 


This is a partial differential equation of the first order in the unknown function W. It 
can have a complete solution (that is, W as a function of gi,...,g n ,< and of course, the 
constants of integration) known as the complete integral which will contain as many arbitrary 
constants (that is, constants of integration) as there are independent variables. 

For a system with n DOF we have n independent coordinate variables and the time 
variable t, so the complete integral W must contain n + 1 arbitrary constants, of which 
one would be simply an additive constant, since the equation contains only the derivatives 


Copyrighted material 



Hamilton-Jacobi Theory 277 


in the form of dW/dt and {dW/dqi} and not W itself. Thus the remaining n arbitrary 
constants must appear as arguments of W so that the solution (complete integral) has the 
form 

W = W(q u ..,a n ) + A (10.1) 

where ai ,..., a„ and A are the constants of integration. 

In 1845, Jacobi proved a theorem, now known as Jacobi’s theorem which was published 
posthumously and which asserts that the system would dynamically evolve in such a way 
that the derivatives of W with respect to a’s remain constant in time and the equations of 
motion would simply read as 

dW 

^ = 0i i = 1(10.2) 
where fa's are n constants of motion. 

Usually a’s are called the first integrals of motion and 0’s are called the second integrals 
of motion. Now given the complete solution W, Eqs (10.2) and those from the definition of 
(initial) momenta, that is, 

= Via * = l,...,n (10.3) 

(«.-• .t=n) 

make up two systems, each of n inhomogeneous simultaneous algebraic equations in n 
unknowns, respectively <fcs and QjS. 

Assuming that the Jacobian determinant for the transformation does not vanish, we can 
solve the Eqs (10.3) for a<8 and substituting these a,s in Eqs (10.2) we finally solve for <frs 
giving 

Qi = Qi{Qlai • • • lQnaiPiai • • • iPna]0U • - • I0n]t) = 9t(?l o ,•••, ?na i Pla »•••> Pno I f) 


ew 


because 


dw 
0i - -fa 




Substituting these expressions for q<s in the equations 


dw 

dqi 


= Pi 


(10.4) 


(10.5) 


we get 

Pi = Pi(?lo,-..,9no;Pl«,-..,Pno;/?l,...,^„;0 = Pi(<7la,...,9na;Plo,...,Pna;<) 

thus completely solving the problem in terms of the initial coordinates and momenta. 
Proof of Jacobi’s theorem 


Given the complete integral solution for W given by Eq. (10.1), we wish to prove Eq. 


Copyrighted material 



278 Classical Mechanics 


(10.2). Consider, 


d_ (dW\ 
dt \ dai) 


9 l 

'dw\ 

d 2 w 



~ dt ( 

K dai) 

+ dqj dai 

Qi 


d 

(dw\ 

\ , 9 ( 

' dW 


~ da { 

[dt j 

1 + d^\ 

V dqj 

r 

d 

(-» 

( dW\ 


d?w 

dai 


+ 

daidqj 


dH_dqj_ _ dH d 2 W d 2 W . 
dqj dai 0 ^ dw ^ da{dqj daidqj ^ 


Note that since g,s and a^s are independent, dqj/dai = 0, so that we can finally write 

d_{dW\ / \ d 2 w 

dt \dai) \ dpj + ^) daidqj 

by Hamilton’s equations of motion, and prove the theorem. 


The method of solving dynamical problems involving holonomic systems via HJ equations 
is considered to be the most powerful method, provided one can indeed integrate the HJ 
equation to get a complete integral. It is worthwhile to state a formal stepwise procedure 
for tackling a dynamical problem using the HJ method. 

1. Construct the Hamiltonian H(p,q,t) of the system. 

2. Set up the HJ equation by substituting all pi = ( dW/dqi ) in the expression for the 
Hamiltonian. 

3. Find the general solution of the HJ equation in the form of a complete integral (10.1). 
Usually one follows the method of separation of variables to solve the HJ equation. 

4. Apply Jacobi’s theorem to this solution, that is, set dW/da{ = ft, where {ft} is 
another set of n arbitrary constants. 

5. Now dW/dai = ft are a set of n inhomogeneous simultaneous algebraic equations 
in n unknowns {g*} involving 2n constants ({«<}, {ft}) and time t. Solve these for {g,} 
to get 

9, = 9,(ai,...,an;ft,..,^n;<) (10.6) 

6. Now {a*} and {ft} can be expressed in terms of initial coordinates and momenta by 
inverting the Eqs (10.3) and using Eqs (10.4) respectively, in that order. Substituting these 
expressions for {aj} and {ft} into Eqs (10.6) gives {g<} as functions of initial coordinates 
{g»o}, momenta {pi„} and time t : 

Qi = 9i(?la> • • • j 9nai Pla j • • • > Pnai 0 (10.7) 


Copyrighted material 



Hamilton-Jacobi Theory 279 


7. Now substitute the functions (10.7) into Eqs (10.5) to get 

Pi = P*(9l«»-*-»?n«;Plo>*--,Pn«;i) (10.8) 

Equations (10.7) and (10.8) constitute the complete solution of the given dynamical problem. 

8. Now if one wishes, one can obtain an expression for the total energy of the system, 
which is in general not a constant of motion, from 


For conservative systems, one can in fact have two choices. One can either start with 
the time dependent HJ equation and solve by the method described above, or one can 
write down the time independent HJ equation in terms of Hamilton’s characteristic function 
S(qi,-..,q n \E), namely 

'(»..£)“* (109) 

and follow a somewhat ‘similar’ procedure. This is left as an exercise. However, there would 
be a corresponding Jacobi’s theorem, and the organisation of the constants of integration 
for the complete integral would be somewhat different. 


10.2 CONNECTION WITH CANONICAL TRANSFORMATION 


Let us now make a canonical transformation from (g,p) to [Q, P) via the generating function 

= w (q u ...,g„;ai,...,<*„;<) 


Here we treat the new momenta {Pi = a*}. In order that the transformation be canonical 
we must have, 


0F 2 _ dW 
dq, “ 8q x 


and 



d\V 

dcti 


( 10 . 10 ) 


and 


dF* 

dt 


K = H + -±=H + 


dW 

dt 


The first and the third equations in (10.10) clearly identify W to be Hamilton’s principal 
function on one hand and the complete integral of the HJ equation on the other. The second 
set of Eqs (10.10) defines {<?*} in terms of {&},{<*<},*. The first set of Eqs (10.10) does 
indeed satisfy the definition of momenta in terms of Hamilton’s principal function. The 
second set of the Eqs (10.10) defines a set of new coordinates {Q,} in terms of (ft), 

Note that the equalities in the last of the Eqs (10.10) are not only dictated by the requirement 
of canonicality but are also the consequence of the requirement that W be Hamilton’s 
principal function. 

Now since the transformed Hamiltonian K(P>Q,t) = 0, the corresponding Hamilton’s 


Copyrighted material 



280 Classical Mechanics 


equations of motion, 




and 


OK 

9Qi 


lead to solutions 


Pi = const. and Qi = const. 
However, we already know that 

Pi = const. = a i 


where aj are the constants of integration in the HJ equation. But Qi = const. = 
dW/dcti are the new results of the identification of W with the generating function F 2 
of the above canonical transformation, that is, through F 2 = W. Therefore, 
dW 

-— = const. = ft (10.11) 

aai 

where ft = Qi. 

This is simply Jacobi’s theorem. It is now further clear why {ft} are called the second 
integrals of motion. Using Eqs (10.10) and (10.11) the dynamical problem can be solved 
following the steps described in the last section. So the important result that we derive 
from this section is that the set (a,,ft) is canonically related to the set (pi,qi) through 
a generating function the same as Hamilton’s principal function. Thus the transformed 
Hamiltonian simply vanishes. However, if we had started with the time independent HJ 
equation and tried to prove the corresponding Jacobi’s theorem following the above method, 
we would have got back the first set of Eqs (10.10) as they are, but the last equation of 
(10.10) would not lead to the vanishing of the transformed Hamiltonian, rather we would 
have obtained K = H = constant = E. Jacobi’s theorem follows from Hamilton’s 
equations of motion in terms of the transformed quantities, with the fact that K = 
constant = E. 

We can now see the role of Hamilton-Jacobi differential equation in a proper perspective. 
Our original problem was to solve the 2n Hamilton’s first order differential equations for a 
system with n DOF: 

dH , 8H 

Pi = ~dgi a " d " i = Wi , = 1 -’ n 

To do this we seek a canonical transformation such that the new Hamiltonian is either zero, 
or more generally, depends exclusively on either the new momenta or the new coordinates 
(see section 10.5). Once this is done, the integration of Hamilton’s equations becomes quite 
trivial. The generating function for this canonical transformation is given by the solution of 
the HJ equation. Thus the problem of solving the system of 2n ordinary differential equations 
(Hamilton’s equations) is now reduced to finding a complete integral of a single partial 
differential equation in (n +1) variables, namely the time dependent or time independent 
HJ equation. It is indeed surprising that this ‘reduction’ from the simple to the complicated 
provides an effective method for solving concrete problems. However, it turns out that this 
is the most powerful method known for exact integration, and many problems which were 


Copyrighted material 



Hamilton-Jacobi Theory 281 


solved by Jacobi cannot be solved by other means. 


10.3 HOW TO FIND THE COMPLETE INTEGRAL OF THE HJ EQUATION 


Usually the generalised coordinates are so chosen that the HJ equation can be solved by the 
method of separation of variables. First if the Hamiltonian does not explicitly contain time 
one can linearly decouple time from the rest of the variables in W and write 


= W 0 (q u ...,q n ) + W'{t) 


( 10 . 12 ) 


so that the HJ equation becomes 
dW'jt) 


* 5 “—(...© 

Since the RHS and the LHS are now functions of totally different variables, they can be 
equal for all values of these variables only if the LHS and the RHS are separately equal to 
a constant independent of all the above variables. This is the reason why the method of 
separation of variables can be so powerful in solving partial differential equations. Sometimes 
the decoupling is done, not as a sum of two functions, but as a product of two functions. 
However in the present case we can write 


dW 

dt 


- E 


and 


„ ( dw 0 ew 0 \ 

*r. **’9ft"”"’ 9 ^ j = 


(10.14) 


where E is the constant of separation. Equation (10.14) can be viewed as the time inde¬ 
pendent HJ equation with W„ to be identified as Hamilton’s characteristic function. In any 
case the solution to the time dependent HJ equation for conservative dynamical systems 
thus has the form, apart from a trivial additive constant term, 

W = W 0 (q 1 ,...,q nt a 1 ,... t a n . ll E) - Et 
We can now identify W a with Hamilton’s characteristic function and formally write 


W 0 = S(q J ,...,q n ,a J ,...,a n - 1 ,E) 

One should of course, remember that the function 5 is usually expressed explicitly in terms 
of the terminal coordinates, one set being {qi a } and the other say {^}, instead of the present 
set of arguments of W a . Also note that the nth (or the first!) constant of integration, a n , 
has been replaced by the energy integral E. If we regard {a*} to be some kind of momenta, 
then energy seems to behave like a kind of momentum, a forerunner of a well accepted result 
in special relativistic mechanics. 


10.3.1 Conditions for Separability of Coordinates 

In 1887, O. Staude for systems with two degrees of freedom and later P. Stackel (during 1891- 


Copyrighted material 



282 Classical Mechanics 


1895) for systems with an arbitrary but finite number of DOF, showed that the motion of a 
system whose HJ equation can be integrated by separation of variables is multiply periodic , 
or in astronomer’s language conditionally periodic. The converse of this theorem will be 
proved in section 10.5. Stackel also showed that for systems with a separable HJ differential 
equation, the integral of Jacobi’s action 2 J Tdt can be separated into a sum 

£ / -JhMMk 

k = \ J 

where N is the number of DOF, and f k s are functions of only q k 's, and these oscillate 
between two fixed limits (libration, defined in section 10.5), or are such that their increase 
by a certain constant value leaves the configuration of the system unchanged (rotation). 

However, the necessary and sufficient conditions for separability were first obtained by T. 
Levi-Civita in 1904. These are a set of N(N - l)/2 partial differential equations, given by 

dH dH d 2 H dH dH d 2 H dH dH d 2 H 

dp k dp a dq k dq, dp k dq „ dq k dp, dq k dp, dp k dq, . . 

<1015) 

^ dqk dq, dpicdp, 

fork = 1,...,JV and s = 1,... ,k - l,fc + 

If any set of conjugate variables ( q k ,Pk ) satisfies the conditions of separability then any 
other set connected by a canonical transformation is also separable. A dynamical system 
which in addition to this class of variables is separable in still another set, was called by 
Schwarzschild a degenerate system. 

Further studies on separability were carried out by Paul Epstein (1916), a student of 
Arnold Sommerfeld, and independently by Karl Schwarzschild (1916). 

10.3.2 Separability of Coordinates for Systems under Central or Axisymmetric 
Forces 


The dynamical problems that deal with central or axisymmetric forces can possibly be best 
handled in terms of spherical polar coordinates, in which case the Hamiltonian would assume 
the following general form: 


H = J/(r,^,0,r,^,^) = 


2p 


P± 

r 2 sin 2 6 


+ V(r,0,<f>) 


(10.16) 


If V has an azimuthal symmetry, so that 


V(r,e) = a(r) + ^ 


(10.17) 


Copyrighted material 



Hamilton-Jacobi Theory 283 


we can separate variables in the HJ equation. The Equation (10.14) takes the form 



Since the coordinate <f> is cyclic, we seek a solution of the form 


W 0 = p+<fi + W^r) + W t {$) 


so that 


and 


(?) 


+ *•*<»> + -ih = 13 

sin tf 


>f vari 
fferenl 

I\ 


where 0, p+ and E are the constants of variable separations, for 9 , <t> and t respectively. 
Finally, integrating these ordinary differential equations we get 

i»/* 

W = - Et + prf + / \/3 - 2 pb{9) 


Pi 

sin 2 9 


+ J {m& ~ <*(r)] - dr 


(10.18) 


Similarly, depending on the symmetry in the potential one may need to write the Hamil¬ 
tonian in parabolic, elliptical or cylindrical polar coordinates and try for the separability of 
the variables. 


10.4 WORKED-OUT EXAMPLES 


10.4.1 The Case of 1-D Simple Harmonic Oscillator 
The form of the Hamiltonian given by 

H <«•’> = s + i* 

leads to the HJ equation in the form 


Now let 


dW J_ fd\V\ 2 
dt + 2m V dq ) 


+ j*?’ = o 


^(4.0 = tv,(() + W t (q) 


(10.19) 


Copyrighted 



284 Classical Mechanics 


Putting this solution into Eq. (10.19) we get 




dt 


2m \ dq J 


Since W\ depends only on t and W 2 depends only on q, Eq. (10.19) holds good provided 


dW x (t) 

dt 


= — E and 


1 (dw 2 y 

2 m\dqj 


+ = B 


( 10 . 20 ) 


The first equation has a solution 


W\ = - Et + const. 


and the second has 


Therefore, the complete integral is 

W(q,t) = - Et + W 2 {q,E) + const. 


2 E . , 

[W . 

I2E 

-r- sm 
k 

V 2 E + q 

VT 


Apart from an additive constant, a new constant E has appeared in the solution and 
since this is a one-variable problem, we expected only one constant in the class of a,s. Here 
Qi = E. Now by Jacobi’s theorem 



( 10 . 21 ) 


where 0i is a constant. Its value can be obtained by putting t - 0 and q — g (0) in Eq. 
(10.21). Let us denote it by t 0 . Thus we get an algebraic relation involving constants t 0 
and E and the variable q: 


. , , frn . _j [icq* 

to - - t+ yj — 

(10.22) 

Inverting Eq. (10.22) we get 



(10.23) 


The momentum is given by 


dW dW 7 , - 

v = = -W = v/m(2£_ * ,,) 

which can be expressed as a function of t if we substitute for q = q(i) from Eq. (10.23). 
Finally for energy, we get 



Copyrighted 



Hamilton-Jacobi Theory 285 


10.4.2 The Case of Planetary Orbits in Two Dimensions ( r,0 ) 

The detailed solution for the Keplerian problem in the HJ formulation was first obtained 
by Jacobi. He also solved for the motion under two fixed centres of Newtonian attraction 
(or Coulombian repulsion), a case which, according to Jacobi, is separable in elliptical 
coordinates (that is, in the parameters of families of confocal ellipses and hyperbolas whose 
centres are at the centres of forces). 

The HJ equation for planetary orbits in plane polar coordinates described in the plane of 
the orbit has the following form 

WWA 2 _ * i (dw 0 \* _ 

2p\ dr ) r 2/ir 2 \ 90 ) 

Since this differential equation is cyclic in 0, the solution can be written as 


W o {r,0) = p e 0 + W\(r) 



or 

W,(r) = J [ 2 „ (fi + - jfp.fr 

and therefore, 

W(r,O,p 0 ,E,t) = -Et + p e 0 + J {2 h(e + ' dr (10.24) 

Now by Jacobi’s theorem 

9W _ 9Wi 

= con8t ' “ * = e + ^7 < 10 - 25 > 

This gives equation of the orbit in 0. And finally, 

9W n dWi 

= const. = 0 E = - t + — (10.26) 

giving the equation of orbit in t. 

A general prescription for finding the integral H^i(r) is in section numbers 2.267, 2.261 
and 2.266 of the Table Of Integrals, Series, and Products by I. S. Gradshsteyn and I. M. 
Ryzhik: 

Given R = a + bx + cx 7 


Copyrighted 



286 Classical Mechanics 


I 


= ~^=ln(2 VcR + 2 cx + b) c > 0 

Vr V* 


-1 . _j (2cx + b\ 

- v^ 8m \~v^Tj 


—=. ln(2cx + b) 

V c 


[ dx 1 

J xVr ~ Vo 111 


c > 0 A > 0 A = 4ac 
c < 0 A < 0 
c > 0 A = 0 
2a + bx +2\/aR 


Vo 

1 . _i 2a + bx 

V^ Sm xV- A 
i 2a -f bx 


= tan' 


1 . , _i 2a 4- bx 

—= smh - ■=— 

Vo ivA 


2V z ~aVR 

2a + 6a 
xy/A 
_i 2a 4- bx 


- 7 =tanh ——-= 

Vo 2 VdVR 


> 0 

a < 0 A < 0 A = 4ac 
a < 0 

a > 0 A > 0 
a > 0 


= —= In 


Vo 2 a + bx 

2 Vbx 4- cx 2 
bx 


a > 0 A = 0 


a = 0 6^0 


The planetary motion can also be solved in parabolic coordinates (u, 
* = y/ov cos <f> y = y/uv sin <f> and z = (u - 
We know that the Lagrangian for planetary motion is 


r 1 , ,2\ GMm 

L = -n(x 2 4- y 2 4- z 2 ) + - 


where 


V x * + v z + Z 2 


Mm 


M + m 

The Lagrangian, in terms of the above parabolic coordinates, becomes 

. 1 / \ 1 • 

L(u,v, <j>,u,v,<j>) = -fi(u 4- t>) (-h — ) 4- ~nuv4> 7 4- 

o \ u v ) 2 

Hence the Hamiltonian becomes 

»<«,«, + vpl) + IA - 


-b 2 

(10.27) 


6 2 


(10.28) 


t>, <f>), defined by 
v)/2 


2 GMm 
u 4- v 

2 GMm 
u + v 


Copyrighted 



Hamilton-Jacobi Theory 287 


Thus the HJ equation is 

8W 2 f / dW \ 2 / 8W \ 2 1 /I 1\ 

+ n(u + t>) [ U ( du ) + V V dv ) + 4 \u + v) 

Now taking 

W = - Ei + cti* + ^(u) + W 2 (t>) 

one gets 

/dWA 2 EMmu oc\ 1 GM 2 m 2 

U \ du ) 2(M + m) + 4u ° 2 2 M + m 

and 

/dH^\ 2 _ EMmv of _ 1 GM 2 m 2 

V \ dv ) 2(M + m) + 4u + 012 2M + m 


Thus, in terms of parabolic coordinates also, the planetary motion is decomposable into 
completely separable Hamilton’s principal function. Normally two sets of coordinates, for 
each of which a dynamical problem is separable, do not occur. But the planetary motion has 
this facility because of its added symmetry in the form of the degeneracy of the frequencies 
of radial and azimuthal oscillations. This is more fully discussed in section 10.5. 


10.4.3 Swinging Atwood’s Machine 

Two masses M and m connected via a pair of horizontally placed frictionless and weightless 
pulleys are tied at the ends of an inextensible string. The heavier mass M can move only 
up or down whereas the lighter mass m can oscillate in a vertical plane (r,0) as shown in 
Fig. (10.1). We have, 

the kinetic energy T = + m)f 2 + j mr 2 0 2 

the potential energy V = gr(M - mcos0) 
the total energy E = T + V 
the Lagrangian L — T - V and 
the generalised momenta 

6L , 6L 

p r = -qT = [M + m) r and p e = —r = mr 0 

so that the Hamiltonian is 

H(r,e,Vr,V.) = + J*) + 9 r { M - mcosS) 


The form of the Hamiltonian is such that the HJ equations constructed out of this H 
will not satisfy the separability requirement for the solution of the HJ equation since in the 
last term both r and 0 are mixed. Hence we must look for a suitable pair of generalised 


Copyrighted materi 



288 Classical Mechanics 



Fig. 10.1 A simple Atwood’s pendulum 

coordinates in terms of which the HJ equation would be separable in those new coordinates. 
In 1986 such a pair of the generalised coordinates has been found and reported in Am. J. 
Phys. 54, 142 (1986), and we present the method below. 

We have to transform (r,0) to parabolic cylindrical coordinates (£, 17 ) defined through 

r = \ and 9 = 2tan ~‘ ir^r) 

with the inverse transformations 



The energy integral now becomes 

E = [i (M + m)e + 2m, 2 ] { 2 + |i(M + m), 2 + 2m{ 2 ] if 

+ (M - 3tag* + + W) + (M-3„KV 

This simplifies for the case with M = 3m and henceforth we deal with this special case, 
with 

Pi = (£ a + v 2 ) and p v = 4j) (£ 2 + rf) 


Copyrighted materis 





Hamilton-Jacobi Theory 289 


and 


. G>I + »J)/8 + MV + V) „ 

+ ^ - = E 


assuming further m = 1. Hence the HJ equation becomes 




To separate the variables put 


= w t (() + wm 


which, when substituted in the HJ equation, results in 

(^r) ! + 16s{4 ■ 8E( ’ = 1 = + 

and 2 

+ 16 ^ 4 ~ 8£ ' T?2 = ~ 1 ~ p * + 16 ^ 4 “ 8 ^ 2 

where I is the constant of separation which can be regarded as a new constant of motion 
apart from the energy integral E. From these two expressions for I it is easy to show that 

i6 + V) («v - <v) + i6s (V - 

1 ” «* + u 5 ) 

When translated into the old coordinates r,0,r,0 one obtains 

r/ . . * 9 r$ . 0\ ^7.0 - 9 

J(r,0,r,0) = 16r 2 0lrcos--— sin-1 + !6^r 2 sin - cos 2 - 

This hidden symmetry would not have been obtained through the study of the problem in 
its original (r, 9) coordinates only, or through the study of the Poisson brackets, or the cyclic 
coordinates of the Hamiltonian. The separability of the solution of the HJ equation on any 
given dynamical problem into functions of its individual generalised coordinates is thus an 
extremely useful tool for evaluating all the hidden constants of motion. Unfortunately, one 
has to use a trial and error approach to hit the right coordinate system. 

It has been found that the above problem is separable not only for M = 3m but in 

general for M = m(4n 2 — 1), where n is any integer. It is however not yet known why 

M : to = 4n 2 - 1 has such special symmetries. 


10.4.4 1-D Damped Harmonic Oscillator 
The equation of motion is 

m$ + \q + mu> 2 g = 0 (10.29) 

For quantising such an oscillator one needs to formulate a suitable Lagrangian and Hamil- 


Copyrighted material 



290 Classical Mechanics 


tonian for this system. In the literature one finds a large number of papers justifying one 
or the other Lagrangian, but most of them defy a consistent physical picture of the system. 
For example, if one starts with the energy condition, namely 

E = T + V = Imq 2 + with H = q— - L = E 

2 2 oq 

on integration one would find, 

L = ^ mq 2 - i nujjlq 2 + qh(q,t) 

No matter what h(q,t) is, this L does not produce the right equation of motion (Eq. 
(10.29)). 

Now suppose one forms a Lagrangian that correctly reproduces the equation of motion. 
One such Lagrangian is often quoted as 

L(q,q,t) = e**'” [jmj 2 - im^, 2 ] (10.30) 

On solving the Lagrangian equation of motion one gets 

q = Ae~ Xt/2m coBut (10.31) 

where w = - (A 2 /4m 2 ) > 0. Hence the Hamiltonian is 

- £ = (jmj 2 + imu, 2 , 2 ) e «/“ (10.32) 

and, after using the expression for q in Eq. (10.31) the value of H becomes 

H = ^i4 2 m + ^o^cos 2 u>t + i^sin2a>/ + u> 2 sin 2 a;t| 

We now see that even though the position and the velocities are damped in time, the 
Hamiltonian is not damped, but oscillates with period 7r/u>. The energy averaged over any 
full period is 

S - ^mA 2 + ^ 2 j = (10.33) 

which is constant! 

These kind of fallacies are known to exist with the above standard form of the Lagrangian 
even though it produces the correct equation of motion. So some people claim that the 
above Lagrangian does not correspond to the physical Lagrangian of a damped harmonic 
oscillator, but that it corresponds to that for a variable mass oscillator (for example, a 
bucket oscillating in the rain collecting water at a constant rate). 

Recently, a suitable canonical transformation of the Hamiltonian in Eq. (10.32) has been 
found so that the transformed Hamiltonian does not have any explicit time dependence, 
hence an energy like Jacobi integral can be obtained for the system. 


Copyrighted material 



Hamilton-Jacobi Theory 291 


Starting with the equation of motion (10.29) and defining the canonical momentum by 


dL 

p = - = mq e‘ 


,Xt/ m 


the Hamiltonian in Eq. (10.32) becomes 

H{q } p,t) = e- At / m i + —^q 2 e xt / m (10.34) 

2m 2 

This form of H has explicit time dependence and hence it is not a constant of the motion. 
Herein lies the catch: while calculating H in Eq. (10.33) we tacitly assumed that H in Eq. 
(10.32) was a constant of motion, and that it represented the energy, like a Jacobi integral. 
Now let us make a canonical transformation given by 

Q = qe xt/7m and P = p e" Xt/2m 


with the generating function, 

F 7 (q,P,<) = e Xt/7m qP 
The transformed Hamiltonian becomes 

K{P,Q,t) = ^ ^ + ju,;q‘ + ±QP 

which is independent of time! Hence this Hamiltonian can be regarded as a constant of 
motion. In terms of the old variables this new Jacobi integral of motion becomes 

r _ Xt/m P , .2 .At/m , ^ _ 

1 = ' 2^ + I"' f * + ^ 

and not the old Hamiltonian given by Eq. (10.34) so that the amplitudes of both q and p 
diminish with time but not of the integral J as a whole. 

One can then proceed to set up the HJ equation in terms of the new variables {Q,P) 


8W 1 /dWV 
9t + 2 m\dQ ) 


+ ™*IQ 2 + 


-q( 

2 m V V 


dw 

dQ) 




and seek a solution 

^(Q,,a.<) = - at + W 0 {Q,a) 

where a is the new energy-like Jacobi integral J (= K). On substitution by * = ^rfuJ7,Q , 

where a = \/mu) 0 and b = 2 a/u) 0 , giving a solution 

w o( x) = - x + / \[ b - 0 - t) zHx 

Now a can have values <2, = 2, or > 2, having three different solutions, 


Copyrighted material 



292 Classical Mechanics 


representing an underdamped, critically dam ped and ov erdamped oscillator. To illustrate 
one case we take a < 2 and define c = y/T~^~7?jA to give the solution, 


W(x,a) 


- + J y/b - c 2 x 2 dx 


Now 

or 



sin 1 = «•»•(* + 0) = + 6 say. 


Therefore, the final solution is, 

q = Q e ~ At/2m = Ae- A< / Jm sin(u;< + tf) 

which is a physical solution admitting of no ambiguity or contradiction at any stage. Simi¬ 
larly the solutions for the other two cases follow in their expected forms. 


10.5 ACTION-ANGLE VARIABLES 

This is a versatile technique used for finding all the periodicities in a periodic system without 
completely solving the HJ equations, and was first devised by Delaunay in 1846 in connection 
with some problems in celestial mechanics. However, the name action-angle variables was 
suggested by Karl Schwarzschild in 1916. 

Periodic systems are those which repeat their state of motion after a constant interval 
of time known as the period. In the phase space all such systems need not retrace their 
paths. Those which retrace their paths in phase space are called librating systems. On 
the other hand, for the rotating systems some momenta are periodic functions of their 
conjugate coordinates, but the latter do not return to their original value and increase or 
decrease monotonically with time. Thus libration corresponds to periodic systems having 
velocities q, changing sign over any single period, otherwise qi cannot return to their 
old values after completion of the period. Rotation leads to a situation where q, changes 
monotonically or remains constant while the system returns, after each period in qi, to its 
original configuration. 

Action variables are useful parameters for systems which are periodic, conservative and 
orthogonally decomposable, that is, Hamilton’s characteristic function is completely separa¬ 
ble in all its variables. The action variables are defined as 



where t is not summed over and the cyclic integral is performed over a full periodic variation 
in q 


Copyrighted material 



Hamilton-Jacobi Theory 293 


If the complete integral of the time independent HJ equation 


H 


es_ dS\ _ 

dqi' ‘'dq n ) ~ * 


is 


S{qi,..-,qn\au... t a n ) 


£ Si(qi] Oi,.. • ,a n ) 


where one of the {a,} is the energy E , then 

Ji = j Viiqi = = /fr'® 

= a function independent of 9, 
but dependent on ai,..., a n only 


that is, 

Ji = Ji(a i,...,a n ) 

Now, these n relations can be inverted to obtain 

a< = Qi(J\ } ... ,J n ) 


- l,...,n 


so that the complete integral can also be written as 

Ji,..., J») 


instead of 5(qi,... ,q n ;oi,... ,a n ). {Jj} are here the constants of motion (although called 
action ‘variables’!). We can regard them as new momenta Pi for a canonical transformation 
with a new set of coordinates Qi, conjugate to these momenta P,-, (* = l,...,n). This 
canonical transformation may be generated by a generating function of the type F 2 > same 
as the complete integral of the time independent HJ equation or Hamilton’s characteristic 
function 5(gi,..., q n \ Ji , • • •, Jn)- The conditions are 


Qi = 


dS 

dPi 


dJi 


and 


dS 

Pi ~ 6qi 


with the new Hamiltonian 


K(P,Q) = H[p t q) = 

which does not depend on {Q<}. 

The canonical equations of motion corresponding to the new Hamiltonian K are 


dK 

8Q 


and 


A = 


dK 


dE 


const. = Vi 


dPi dJi 

where {»/<} is another set of constants. Note that Qi ^ 0 


say 

unlike its counterpart in the 


Copyrighted materia) 



294 Classical Mechanics 


time dependent HJ equation, where Jacobi’s theorem suggested {<5»} to be constants of 
motion. Here, the solution for {Q<} is 

Qi = vt + 0i 

where ^ 

Vi = Tr-r- = const. (10.36) 

aJi 

and 0i are the constants of integration. Again Qi s are the coordinates conjugate to momenta 
J;S, and Ji& are of the same dimension as that of angular momentum or action or Planck’s 
constant h. Hence Q<s are basically angle like variables and are therefore dimensionless. 
These Q<s are historically denoted by u\s and are called angle variables , given by 

u,i = uit + 0i (10.37) 


Obviously, t/<s look like frequencies of oscillation of the coordinate which gives rise to 
Ji, and 0iS are the phase constants. 

Thus if the relation E = .., J„) is known, the frequencies of oscillation v# for 

all the coordinates given by 

8E 


can be obtained. If two or more frequencies of oscillation are identical, such a periodic 
system is called a degenerate periodic system. If further, all the frequencies {i/*} are found 
to be identical, the system is termed as a completely degenerate periodic system. 

Now, the new set of variables (i/<,/,•) should completely specify the dynamical state of 
any conservative periodic system and 



Vit + 0i 


can produce the general solution 


Qk = . 

which can be represented by a multiple Fourier expansion of the fundamental frequencies 
{*/*}, given by 

9* = £ ^K ) ...» J r(^i»”+ ft) 1 (10.38) 

= -oo ( j = 1 J 

where N is the number of DOF of the system, and where the summation in the front 
extends over all integer values of nj. 

The motion in general is not periodic, its orbit having the characteristics of an ‘open’ 
Lissajous figure. Only if the s are commensurate does the current point of the orbit ever 
return strictly to to its starting point. 

The use of this technique was limited to astronomers only, until it was brought into 
physics by Stackel, Epstein, and Schwarzschild. If these N frequencies of the system satisfy 


Copyrighted 



Hamilton-Jacobi Theory 295 


n < N linear equations with integral coefficients, the system is said to have an n-fold 
degeneracy. In this case, the orbit does not, as in the nondegenerate case, fulfill in the 
configuration space an JV-fold, but only an (N - n)-fold continuum, and the system can 
be transformed by an appropriate change of variables into a system with (N - n) degrees 
of freedom. 


10.5.1 1-D Simple Harmonic Oscillator 


The Hamiltonian for such a periodic system is given by 

H “ ^ + 5 *’’ 

which sets the time independent HJ equation as 


. i ka 2 _ F 

2m V &Q ) 2 

fp d Q = f f [e - ^kq 2 ^jdq = 

ion is 

8E 1 fk 

V ~ dJ ~ 2 tt V m 


Therefore, 

J 

and hence the frequency of oscillation is 


10.5.2 The Kepler Problem 

The motion is periodic if the energy E < 0. The Hamiltonian is 

" = £(* + 3 + + V(r) 

Taking 


S(r,8,<f>) = 5 r (r) + Sg(6) + a+<f> 


the HJ equation reduces to 


and 


/ dS g \ 2 <*£ 2 2 a l 

(w) + i^ = Q '" p « + ^ 
(it)’ + 0 = - *>)> s * + jf 


(10.39) 


(10.40) 


Copyrighted material 



Hamilton-Jacobi Theory 301 


where * is not summed and 5, = J pidq t , and Si = Si — Jiu)i. Hence, 

At the end of the period r<, Si must change by and u>i change by unity, hence 
Si returns to the same value like a periodic function of u so also (dSi/dX). Therefore 
{d§i/d A) can be a Fourier sum for the fundamental frequency u>j, that is, 

Q* 

= exp(27rv /= T kui) 

where k are integers, so that 

A Ji = - A ^ J 27Ti4fcV / -T A:exp(2jr\/^T ku)i)dt + 0{ A 2 ,A) 

The first term obviously vanishes and since A is a slowly varying parameter, and the 
second term can be neglected so that 

A Ji = 0 (10.45) 

for any periodic and marginally nonconservative system. This fact is usually stated as Ji 
is an adiabatic invariant for any secular (that is, long term) changes in the Hamiltonian 
function. 

Of course, much later Levi-Civita (1934) rigorously laid the mathematical foundation of 
the theory of adiabatic invariants. Adiabatic invariants for more complex systems such as 
compound pendulum and electrically oscillating circuits were solved by P. L. Bhatnagar and 
D. S. Kothari in 1942. 

Examples 

(i) For a 1-D harmonic oscillator from Eqs (10.39) and (10.40), J = E/2tci/. If the energy 
is slowly changed, the frequency will also change proportionately. If the length is changed 
slowly, this will simply lead to a slow change in frequency and a proportionate change in 
the energy. 

(ii) In the Kepler problem, we have seen from Eqs (10.43) and (10.44) 

Thus for adiabatic changes, J r , Jg and J+ should not change over any integral number 
of periods. For example, for a very slow variation in G , the orbital energy must vary as 
|JF| a G 2 , or for a very slow rate of loss of mass of the sun, the energy must change as 
|2?| « M|. Again, since J* = 2np+, any adiabatic change of any kind would keep the 
^-component of angular momentum (p*) constant. 

(iii) Take another example. Suppose that there is a constant magnetic field in which a 


Copyrighted 



302 Classical Mechanics 


charged particle having an electric charge e is moving in a circle. The magnetic induction 
B is changed slowly. The adiabatic invariant for this case is 


J = £ p • dl = £(mv + eA) ■ dl = 2 irmvr + eV x A ds 




(10.46) 


+ 7r er 2 B = 3tt eBr 2 


using mv 2 /r = evB for circular orbits in a plane perpendicular to the direction of B. 
Therefore, Br 2 must remain constant, that is, the total flux threading through the orbit 
must be conserved. If B increases slowly the orbit must slowly shrink as r a B~ 1 ^ 2 . 


(iv) The idea of adiabatic invariants (that is, Ji = constant) is extremely useful in many 
practical situations. Historically it has also provided the major guideline for the formulation 
of the early quantisation rules for the atomic systems. The main idea that 

J r — n r h — n$h = n^h (10.47) 

has been used to construct the stationary energy levels of an atom. If we do not perturb 
the stationary states of atoms by sufficiently strong electric and magnetic fields, atomic 
systems do not make a transition (in the Bohr model of atoms). Thus classical concepts like 
adiabatic invariants are helpful in making oneself comfortable with the idea of the existence 
of stationary states of an atom. 


10.7 CLASSICAL-QUANTUM ANALOGIES 


In this section, we would like to analyse some of the limitations of classical dynamics and 
see how far the latter can be regarded as a stepping stone towards developing more complete 
theories of nature, namely the relativistic and quantum theories. 


10.7.1 Argument for Constancy of the Speed of Light in Vacuum 

We know that Maxwell’s electrodynamical equations in usual notations 

V. D = p V x E = - — 
dt 

y . B = 0 V x H =s t + ^ 
dt 

supplemented by the relations 

D — eE B = fiH j = <tE 


(10.48) 


can give with a little vector manipulation 


dH 


d 2 H 


V x (V x H) = V(V-2T) - V 2 H = - 


Copyrighted material 



Hamilton-Jacobi Theory 303 


so that one obtains the wave equation in H , 


&H 1 „ a _ <r8H 

dt * ef i ^ e at 

(10.49) 

and similarly for the wave equation in E , 


&E 1 .. <rdE 

at 2 cfi t at 

(10.50) 


Since for propagation in free space, the Ohmic conductivity <r = 0, the dielectric per¬ 
mittivity e = t 0 and the magnetic permeability fi = n a are constants, the third term 
in both Eqs (10.49) and (10.50) vanishes, and we get the standard electromagnetic wave 
equation of the form 

- (« J V J )F = 0 (10.51) 

where v = ( e 0 fio)~ 1 ^ a = c is the constant speed of propagation of the F vector (F 
can be either E or H). Taking the bracketed quantity in Eq. (10.51) as a space dependent 
operator, the general D’Alembertian solution of Eq.(10.51) is like that of an SHO with its 
spring constant replaced by the operator ±*(« • V), 

F(r,l) = e'<» V >/i(r) + <“' ,rVl /:(>-) 

= (l + tv V + + •)/,(«•) + (1 - (t>V + .••)/»(«-) 

= /i(r + *<) + / 2 (r - »£) by Taylor's theorem 

The first one is a solution for backward propagation and the second one is a solution for 
propagation in the forward direction. The functional forms of fi and / 2 with respect to 
their arguments can be anything. However, as a particular solution, a monochromatic plane 
wave can propagate in the forward direction with a constant angular frequency u; and a 
constant wave vector ft, such that / 2 is described by a cosine function and its argument 
k-r - u )t is just k- (r - vt ), satisfying the property required by the general solution. 

Since the speed of propagation in free space v = ( e 0 ^o)~ — c, depends only on the 

properties of vacuum, it should not change with respect to any motion of inertial observers. 
Supposing that it does, we can possibly think of riding on one of the crests of the plane 
monochromatic electromagnetic wave, which means that we should be able to see totally 
stationary solutions for the electric and magnetic fields described by 

E = E 0 coa(k-r) H = H 0 cos{k-r) 

Putting these expressions in Maxwell’s equations, we get 

Vx£ = 0 = VxJ? and VE = 0 = VH 

When both the divergence and curl of a vector vanish, it is either a constant vector or a null 
one, implying that we can never ride on a freely propagating plane electromagnetic wave 
and make it appear stationary. The speed of light in vacuum is a property of vacuum and 


Copyrighted 



304 Classical Mechanics 


must be an observer-independent quantity, which is what was postulated by Albert Einstein 
in his special theory of relativity. 

10.7.2 Planck’s Quantum Law of Light 

In chapter 6, we have seen that if W(r,t) = / Ldt is Hamilton’s principal function and 
5(r) = f p- dr Hamilton’s characteristic function, they should satisfy for free motions 

dW 

W = S— Et = p r-Et S = p r p = VW E = - — 


— = L = p r - E (10.52) 

dt 

where E is the constant energy of the particle and p its linear momentum pointing normally 
to the surfaces of constant W or constant 5. 

Now if we want to visualise the free propagation of a plane electromagnetic wave as a free 
motion of a particle, we can exploit the nice similarity between the form of the argument of 
the cosine wave and the form of the expression for W, as explicit linear functions of r and 
t. The comparison immediately suggests that 

p ex k and E oc u> 

which must have an identical constant of proportionality. If we choose the constant of 
proportionality to be ft = h/2ir, ft being the Planck constant, we get the Planck law of 
energy , and Einstein’s law of momentum of photons, the quantum of light in particle form, 
given by 

E = flu = hv and p = hk = jk (10.53) 

where A is the wavelength of the monochromatic light. It is now easy to find that the 
Lagrangian for the motion of light particles is 

L = pc - E = 0 and W = tl(k r - u/t) (10.54) 

Here W/h is now to be interpreted as the phase angle of the electromagnetic wave at t = 0, 
and the vanishing of L is not new (see Eq. (5.18)). 

Note also that the increment in the value of 5, the Jacobi or Lagrange action, over one 
complete wavelength is simply ft, the Planck constant. 

10.7.3 Uncertainty Principle for Photons 

Plane monochromatic electromagnetic waves are represented by ideal cosine functions of the 
phase angle, which have neither a beginning nor an end. In nature the process of emission of 
light, say by an atom or an accelerating charged particle, cannot be indefinitely long. Hence 
the best realization of a plane monochromatic beam of light must consist of truncated pieces 
of EM wave trains, called packets of waves. If the individual process of emission of an EM 
wave lasts for At, the length of individual wave packets would be then, Ax = cAt. Now 
how to make a packet out of never-ending infinitely long and uniform cosine waves? We 


Copyrighted 



Hamilton-Jacobi Theory 305 


know that two propagating cosine waves with nearly equal wavelengths or frequencies can 
superimpose on each other to produce beats or an envelope of packets. The whole point 
is that monochromaticity has to be sacrificed. Let us allow a spread in w as Au;, or 
equivalently, a spread in wavelength A A = - cAi//*/ 2 , such that across the distance Ax/2 
the wave of wavelength A + AA lags behind that of wavelength A by A/2 and thus produces 
a totally destructive interference, at the two termini of the wave packet along the x-axis. 
Thus we minimally require 

^AA = A or Ai/At = 1 
With the Planck law, this condition for photons reads as 

AE • At = h = Ap-Ax (10.55) 

This is in consistence with Heisenberg’s uncertainty principle applied to photons. 

10.7.4 Schrodinger’s Wave Equation for Matter Particles 


Since the linear momentum of a particle is given by VW, the particle always moves normal 
to the surfaces of constant W ^nd its magnitude is given by the space rate of change of W , 
that is the change over unit distance along this normal, keeping of course time as constant. 
If W is expressed in units of h by defining an imaginary dimensionless parameter - t'P 
through the relation W = - ih'f, i = y/- 1, then we would have 


-ihV<H 
dW 
di 

and the Hamilton-Jacobi equation would read as 


E = H - - - 


= ih m 


-L v2 * + v * = *% 


(10.56) 

(10.57) 

(10.58) 


The classical-quantum correspondences are now transparent. Schrddinger’s wave equation is 
the quantum mechanical form of the classical Hamilton-Jacobi equation of motion. Instead 
of finding a solution to the W function, one looks for a solution to the # function. It is the 
interpretation of 9 and the involvement of complex numbers in the Schrodinger equation 
that deviate from the spirit of classical descriptions. 


10.7.5 Commutator Relations 


Let us suppose that A y B y C and D are any four dynamical variables that are functions 
of coordinates and momenta. The Poisson bracket of the products AB and CD can be 
expressed in the following two equivalent ways: 

[AB,CD] = A[B,CD} + [A,CD]B = AC[B y D] + A[B y C)D + [A y C]DB + C[A y D]B 


Copyrighted 



306 Classical Mechanics 


and 

[AB,CD] = [AB,C]D + C[AB,D] = A[B,C]D + [ A,C)BD + CA[B,D) + C[A,D]B 
keeping the proper order. On comparison, we find 

AC\B,D] + [A,C}DB = [A,C)BD + CA[B,D] 
or 

(AC - CA)[B,D] = (BD - DB)[A,C ] 

This is satisfied provided 

AC - CA = a[A,C] and BD - DB = a[B,D} 

where a is any universal constant. The quantum-classical correspondences between quan¬ 
tum commutator relations and classical Poisson brackets suggest that a = ih, so that for 
any two dynamical variables A and 5, 

AB - BA = ih[A,B] (10.59) 

Understandably, from the dimensional point of view, the Poisson bracket contains an extra 
dimension of the angular momentum or action in its denominator, which is killed by the 
dimension of h in Eq. (10.59). 

The entire chapter 9 can be converted into a topic of quantum commutator relations, 
with p substituted by - iftV and r for as. For example, the angular momentum operator 
becomes - ihr x V. 

Werner Heisenberg, Max Born and Pascal Jordan showed in 1925 that the ordinary Hamil¬ 
tonian equations of dynamics were still valid in quantum theory, provided the symbols rep¬ 
resenting the coordinates and momentum in classical dynamics are interpreted as operators 
whose products did not commute. Two years later Heisenberg put forward the uncertainty 
principle which vindicated Hamilton’s intuition of the duality between generalised coordi¬ 
nates and generalised momenta. 

10.7.6 De Broglie Hypothesis 

Let us consider the motion of a free particle both classically and quantum mechanically. 
The Hamilton-Jacobi equation 



yields the usual solution W = p-r - Et. Now its quantum mechanical equivalent, the 
Schrodinger equation 

- ^-V 2 * = ih^- = EV with pV = — iflV9 


Copyrighted 



Hamilton-Jacobi Theory 307 


produces a solution 


¥(r,f) = y o e i{kr ~ ut) k 2 


2m E 


p = hk or the wavelength 


w = — and 
n 

X= h - 

P 


(io.6o; 


So the quantum solution for a free particle motion corresponds to a plane monochromatic 
wave propagating in the same direction as the momentum of the classical particle. If there 
is a beam of particles with a sufficiently small spread in energy, the assembly of waves turns 
into packets, the group velocity of which ( v g = du/dk), from the above expressions for a; 
and k is found to be p/m = the classical speed of the particle. So Eq. (10.60) represents 
the mathematical form of De Broglie’s hypothesis, namely a reciprocal relation between the 
momentum of a particle and its wavelength of propagation in its wave form. According to 
the above hypothesis every particle can be thought of as a moving train of waves having the 
wave vector lb and angular frequency u>, as given above. 

If we now closely scrutinize and compare the equations and solutions for the classical and 
quantum descriptions, we can note the following points: 


(a) The classical equation is a second degree first order partial differential equation in W , 
whereas the quantum equation looks like a diffusion equation with its time part containing 
an imaginary coefficient. 

(b) Both the equations are usually solved by the method of separation of variables, but 
the classical one as a sum and the quantum one as a product of two functions. 


(c) The time part has a solution linearly varying with time for the classical equation, but 
has an oscillatory exponential behaviour for the quantum equation. The same is also true 
for the space part. 


It is nevertheless nice to see that the classical free particle solution for W appears as a 
factor in the exponent of its quantum solution, namely the $ function, that is, 

¥(r,f) = ty„e iW/k (10.61) 

This particular classical to quantum correspondence has been fully exploited by Richard 
Feynman in developing his path integral formulation of quantum mechanics. Since the 
function W = Ldt , and its variation over different chosen paths keeping the end point 
coordinates (not momenta) fixed vanishes near the actual path, by Hamilton’s principle, the 
correspondence becomes all the more meaningful. 


10.7.7 Bohr-Sommerfeld Quantisations of Atomic States 

This has already been hinted at the end of section 10.6. The main problem with classical 
atoms is that they are unstable. Classical electrodynamics predicts that any accelerating 
charged particle having electric charge e, mass m and acceleration a should radiate elec¬ 
tromagnetic radiation with power P = n„e 2 a/6icc. If in a hydrogen atom the electron 
is revolving in a circular orbit of radius about 0.05 nm with a speed of about 2200 km/s, 


Copyrighted material 



308 Classical Mechanics 


it will be orbiting at a rate of about 7 x 10 15 revolutions per second. But because of its 
tremendous centripetal acceleration of about 2.5 x 10 21 times g, the electron will radiate 
and spiral into the nucleus in about 10“ 11 second. Why are then the atoms so stable? 

The classical theories completely break down on this point. But if you use the De Broglie 
hypothesis, you can calculate the wavelength of the matter wave associated with the speeding 
electron around the nucleus, and compare it with the perimeter of the orbit ( 2nr ). If they 
match by any exact integer factor, the waves would superimpose onto themselves all the 
time and form stationary states. Thus the condition for the occurrence of the stationary 
states would be 

_ « n h 

2irr = n\ = — 

P 

or in terms of the orbital angular momentum J of the electron 

J = pr = nh (10.62) 

where n is any positive integer. This condition is known as the Bohr condition for stationary 
states of all hydrogen like atoms, after its Danish inventor Niels Bohr (1913). However, Bohr 
proposed it as a hypothesis, and started calling n as the principal quantum number, years 
later. The angular momenta of the stationary orbits are discrete and jump only in integral 
multiple of h. 

Within two years W. Wilson and Arnold Sommerfeld independently suggested that the 
circular orbits of the Bohr atom ought to be replaced by elliptic orbits. They were compelled 
to postulate a new, azimuthal quantum number to relate to the ratio of minor to major axes 
of the ellipse. They proposed the phase integrals f p,dq ,■ to be similarly quantised as Bohr’s, 
and gave two quantum conditions instead of one given by Bohr in Eq.( 10.62), 



where n r is called the radial quantum number and n* the azimuthal quantum number, the 
^-motion being described in the plane of the orbit. They showed the ratio of the minor to 
major axis of the elliptical orbit is simply k/(n r + n*), and the energy of the electron in 
that orbit is proportional to (n r + n*) - 2 . The principal quantum number n is defined to 
be n = n r + n*, where n > n* > 0, with n* = 1, 2, 3, 4, ... are spectroscopically 
identified as sharp (s), principal (p), diffuse (d), fundamental (f), ... series of spectral 
lines respectively. For a given n, the degeneracies in energy for the possible range of n* 
were shown by Sommerfeld to be removed due to special relativistic variation of mass over 
elliptical orbits. 

The connection of these phase integrals with the action variables and the link between ac¬ 
tion variables and adiabatic invariants were pointed out by Epstein (1916) and Schwarzschild 
(1916). This paper of Schwarzschild was published on the day he died. Schwarzschild showed 
that there should be three quantum conditions (10.47) for three adiabatic invariants or ac¬ 
tion integrals; with n* = n® + n$, n$ being the latitudinal quantum number, and the 
equatorial or magnetic quantum number. Since the Kepler problem could also be treated 
in parabolic coordinates, the question was whether parabolic quantum numbers should also 
give the same answer to the energy levels. This ambiguity was successfully resolved by both 


Copyrighted material 



Hamilton-Jacobi Theory 309 


Epstein and Schwarzschild, and the results of computation for the Stark effect in parabolic 
coordinates agreed well with those computed on the basis of the spherical polar coordinates 
(work out the problem no 10.6). 

In a sense, all these developments glorified the accidental symmetries of the Coulomb 
field in atoms, as well the power of the well-developed classical teclmiques for handling such 
subtle issues. 


10.8 SUMMARY 

The standard method of formulating and solving any dynamical problem according to the 
Hamilton-Jacobi theory has been given. Like D’Alembert’s principle, the formulated prob¬ 
lem reduces to a single equation, in the present case a first order partial differential equation 
(of usually the second degree). By Jacobi’s theorem any complete integral of this partial 
differential equation is capable of producing the full solution to the problem. 

However, finding a complete integral is the most challenging part of this scheme. Com¬ 
plete separability of variables becomes a precondition to finding a complete integral, and 
therefore, only those problems can be fully studied for which complete separability in the 
chosen variables is achieved. Subject to tliis, all the hidden constants of motion, however 
complicated in form, can be found out. In the following chapter we shall see that for small 
oscillations, the separability is guaranteed, — obviously not for any arbitrary motion of 
the system but for small amplitude oscillations. The necessary and sufficient conditions for 
complete separability for any arbitrary dynamical system are given by Levi Civita. 

Periodic systems can be studied with the help of action augle variables provided the 
systems are conservative and completely separable in coordinates. All the frequencies can 
easily be found out without solving the equations of motion. Only in degenerate cases can 
more than one set of coordinates make the characteristic function completely separable. 

Adiabatic invariants are extremely useful for handling periodic systems that are subject to 
slow perturbation. In nalure, no finite system can be considered as totally closed. Periodic 
systems react in such a way that all the action integrals of the system remain unchanged to 
the first order of perturbation. Even atoms do not make a transition unless a sufficiently 
strong perturbation is given to the system leading to a direct change in the action integrals 
that label the orbitals. 

The final section on classical-quantum correspondences points out the fact that classical 
mechanics has indeed provided not only all the raw material but also a fully developed 
infrastructure needed for the early development of quantum mechanics at an extraordinarily 
fast pace. 


PROBLEMS 

10.1 Prove Jacobi’s theorem for the time independent Hamilton-Jacobi theory. 

10.2 Solve for the parabolic motion of projectiles under the constant force of earth’s gravity 
by using the Hamilton-Jacobi method. 


Copyrighted material 



310 Classical Mechanics 


10.3 Consider the motion of a body of unit mass on the constrained path y = cosh z 
under a potential V = x a /2. Solve Hamilton’s equations of motion directly as well 
as by using the Hamilton-Jacobi method. 


10.4 Find the canonical transformation through the solution of the Hamilton-Jacobi equa¬ 
tion, the generating function for which is J*j = S(q, P) that transforms the Hamil¬ 
tonian 

p 2 mw 2 q 2 


H(q,p) 


2m 


to K{Q,P) = f(P) only. 

10.5 Solve the Hamilton-Jacobi equation for a particle moving under a potential field given 
by 

nr) = 

the field of a dipole, where a is the constant dipole moment vector. Choose the 
z-axis along a. Find the integrals of motion and the total cross-section for particle 
absorption into the central region of F(r). 

10.6 Separate the variables in the Hamilton-Jacobi equations for the problem of the Stark 
effect with the potential 

V(r,z) = y + Ez 

where E is the applied constant electric field along the z-axis, using the parabolic 
coordinates u, v and <p: 

X = y/uv COS <t> y = y/uv 8in<f> Z = (tt — v)/2 


10.7 Find the action and angle variables for the potential energy V(q) = V 0 tan 2 (aq), 
where V 0 and a are positive constants. What is the angular frequency of oscillation? 

10.8 Show that the motion of a particle of mass m under a noncentral potential energy 


V(r) 


2 mr 2 


sec V - 


leads to identical expressions for the action integrals J r and as those of Ke¬ 
pler’s problem. Find the energy function as E(J r ,J g ,J+) and show further that the 
frequencies u r = U+ but i/g = 2i/*. 

10.9 A particle moves inside an elastic sphere, the radius of which changes slowly. How 
does its energy change? 


10.10 Determine the change in the energy of a charged particle moving in a central field of 
potential V(r) when a weak uniform magnetic field B is slowly switched on. Take 
the Hamiltonian 


^ ( p; + ^ t ?&#) + mV(r> - h tBr * 


Copyrighted 




11 

Small Oscillations 


11.0 INTRODUCTION 

So far we have been dealing with dynamical systems moving under given constraints, and 
sought for complete solutions as functions of the initial conditions and time. However, in 
many situations we notice that a dynamical system is apparently at rest or in some steady 
state condition. If we give a small perturbation to the system possibly by pushing or shaking 
it a little bit, only two things may happen: either the system as a whole or part of it settles 
in a to-and-fro oscillation about its original position or configuration, or it moves away and 
does not return to its original position at all. In this chapter, we shall study the fate of all 
possible small perturbations given to a dynamical system in mechanical equilibrium, and 
see under what circumstances oscillations are possible, and with what frequencies, phase 
lags, etc. 

The theory of small oscillations was developed primarily by Lagrange, D’Alembert, Poinca¬ 
re, and Liapounoff. In recent times, stability analysis has become an extremely important 
part of all kinds of dynamical investigations. The key question is, given a small perturba¬ 
tion, does it grow with time or, equivalently, what is the dispersion relation (see Eqs (11.23) 
and (11.24)) for imposed periodic solutions? 

11.1 TYPES OF EQUILIBRIA AND THE POTENTIAL AT EQUILIBRIUM 

An equilibrium state of a system is defined to be one in which all forces (internal as well as 
external) cancel for some configuration of the system. A system in a state of equilibrium, 
continues to be in that state for all times unless perturbed by an external agency. However, 
the concept of equilibrium can also be defined without making any reference to force. It 
can be defined as a state in which the time derivations of physical variables (observables) 
vanish. Therefore, this definition can very well be used in the context of quantum mechanical 
descriptions. 

Equilibria are classified in the following way. 

Static Equilibrium 

The state of zero kinetic energy continues for an indefinite period, and the immediate sur¬ 
rounding of the system is not changing with time. An example is a.stone at the bottom of 
a valley. 


Copyrighted material 



312 Classical Mechanics 


Dynamic Equilibrium 

The net force on the system is zero and the system continues with zero kinetic energy, but 
the immediate surroundings of the system change with time in such a way that it exerts a 
balancing force on the system which contributes to the net force experienced by the system. 
Examples: charge neutrality of atoms, molecules and solids makes them exert zero electrical 
force on one another, but each of them is in dynamical equilibrium. To give another example, 
consider a ball static on the head of a fountain. It has zero kinetic energy, but its immediate 
surrounding is changing all the time, hence it is in dynamical equilibrium. 

Stable Equilibrium 

Given a small displacement, the system tends to return to the original equilibrium configu¬ 
ration. Example: the bob of a simple pendulum in its equilibrium state satisfies the above 
stability criterion. 

Unstable Equilibrium 

Given a small displacement, the system does not return to the original equilibrium config¬ 
uration. Example: a large stone sitting on the upper edge of a cliff. 

Metastable Equilibrium 

Given a sufficiently large displacement, the system fails to return, although for smaller 
displacements it could return to the original equilibrium configuration. Example: a balloon 
which explodes above a certain gas pressure. 

Let us now consider the dynamical configuration of the system from the point of view of 
energy, particularly potential energy. The total energy of a conservative system is given by 

E = T + V = kinetic energy + potential energy 

Since kinetic energy is always a non-negative quantity (T > 0), we must have E > V for 
allowed motions, and the system can stay at the equilibrium state (T = 0) only if E = V. 
We know that for a conservative system the total energy E cannot change with time so 
that 

E = E 0 = const. 

and for the system at rest and at equilibrium, 

V = V 0 = const. = E 0 

Now in order to judge the stability of an equilibrium state, we have to give a small 
perturbation to the system. For simplicity, let us consider the motion in one dimension, and 
assume that the potential energy V is a given function of the space coordinate x. Let the 
system be in a state of static stable equilibrium with energy E 0 and position x 0 (see Fig. 
11.1). Obviously the kinetic energy of the system in this state is zero ( T„ = 0). Now let us 
displace the system to a nearby point x\ (by applying some external force other than the 
given potential field) and release it. (The extra perturbing force is then switched off.) The 
potential energy of the system at zj is now V(x\) and the kinetic energy at x\ is zero. So 


Copyrighted 



314 Classical Mechanics 


The first of these two equations follows simply from the requirement that the net external 
force on the system in equilibrium must vanish. 

Now let us expand V(x) in the vicinity of the stable equilibrium position x 0 . This gives 

v ( x) = v(«„) + g) <= j* - *o) + - *»>’ + - (11 - 2) 

Let us further choose the origin of the coordinate system at the position of the equilibrium 
so that x 0 = 0 and also choose V(x 0 ) = V(0) = 0. Using Eq. (11.1) and neglecting 
higher order terms in Eq. (11.2) we get 

V(x) = (11.3) 

where A: > 0 is the constant value of ( <PV/dx 2 ) at * = x 0 = 0. Now, the kinetic 
energy of the system is 



so that the Lagrangian of the system slightly displaced from the position of stable equilib¬ 
rium, for the 1-D case, becomes 

£(M) = \rnv 2 - ^kx 2 = ^mx 2 - l -kx 2 (11.4) 

This gives rise to the equation of motion in the form 

mx + kx = 0 (11.5) 

Since both k and m are positive constants, Eq. (11.5) is an equation of motion for a simple 
harmonic oscillator, whose solution is given by 

x = ci cos(u><) + C 2 sin(urt) 

where 

V m 

or 

x = acos(u;t 4- 6) (1**6) 

with 

ci = a cos 6 C 2 = - a sin 6 a = yjc\ + c\ 

and 

* = (- c i) 

Therefore, the frequency of oscillation is simply the square root of the ratio of the coefficient 
of x 2 in the expression for potential energy and that of v 2 in the expression for kinetic 
energy. 

In terms of the initial coordinate x a and initial speed v a , the expressions for the ampli- 


Copyrighted 



316 Classical Mechanics 




Fig. 11.2 Examples of oscillators: (a) a diatomic molecule, (b) 1-D constrained motion of 
a particle on a horizontal line when the particle is tied to one end of a spring (of 
length 1 in its relaxed state), the other end being fixed at a height / above the 
origin, (c) a column of liquid in a U-tube 


combinations having reduced masses fi' and /i, is given by 

u>' _ m\rri2(m\ + m' 2 ) 
u) V m\m' 2 (m\ -f m2) 


( 11 . 11 ) 


Hence by measuring u' and u>, the difference between the isotopic reduced masses can 
be determined, which will help identify the isotopes themselves. 

2. The small oscillations of a massless spring fixed at one end A (see Fig. 11.2b) and the 
other end carrying a particle of mass m which is constrained to move on a horizontal axis. 

The spring requires a force F = |F| when its lower end is at B. We want to find the 
frequency of oscillation of the particle if the spring constant is k , and the length AB = /. 

The potential energy of the spring at any point at a distance x from B, compared to that 
at B, is given by 

V = F 61 

Forx < /, |£/| = VI 2 + x 2 - 

Lagrangian for the system is 

L = 

giving the frequency of oscillation 

u> = 


l ~ x 2 /2/, and therefore V = Fx 2 /2l. Thus the 



Copyrighted 



318 Classical Mechanics 


any generality, the above inequality becomes 

Vfa. — .ft.) > o 


(11.14) 


with the equality applying only at the origin. 

Expanding V in a Taylor’s series in qi for small values of qi ’s one gets 


v(„—= no,...,o) + 


(BV\ 1 ( &v \ 

I + higherorderterms 


The first term vanishes by choice, the second term vanishes by condition (11.13), and by 
the inequality (11.14) we must have Writing 


1 / ^ \ 

2 \dq i dq j ) 0 


which are constants strictly independent of qi ’s, the positivity condition on V given by Eq. 

(11.14) becomes 

bijqiqj > 0 (11.15) 


with the equality holding only at the origin (qi = = ... = q n = 0). Inequality 

(11.15) can be written, in terms of matrices as follows 



(11.16) 


Since qi ’s are arbitrary (except for being defined in some small neighbourhood of the 
origin) the Eq. (11.16) is really a condition on the nature of the matrix [6jj] requiring it 
to be positive definite (that is, determinants of till orders upto n formed out of [fty] are 
non-zero and positive). Thus the stability of the equilibrium at the origin requires that [6,^] 
as defined by Eq. (11.15) be a positive definite matrix. 

Similarly the kinetic energy of a scleronomic system is homogeneously quadratic in the 
generalised velocities and is given by 


T = a' ; 9.9; 

where {a-} in general, are functions of qi ’s. Expanding a' { - around the origin (a point of 
stable equilibrium) in Taylor’s series, we get 




i 

«* + 2 ! 



q k qi + ... 


Now it turns out that in most of the cases, the quantities ( da'ij/dqk ) and their higher 
order derivatives evaluated at the origin are negligible. In the case of Cartesian coordinates 
these are exactly zero; for spherical polar coordinates these are of second order smallness. 


Copyrighted 



Small Oscillations 319 


Hence, to a good approximation, the ay’s are taken to be their values at the position of 
stable equilibrium, that is, the origin, and are therefore set to 


an = «5,(o.o) 

Therefore, the kinetic energy is given by 


T = anqiqj > 0 


(11.17) 


which is once again a non-negative quantity, the equality holding only for the case when all 
qi ’s are zero. 

Again, the inequality in (11.17) is a matrix inequality involving the matrix [ay] and 
column vectors [g,] and [g,]. Thus [ay] is also a positive definite matrix. Further, we note 



and 



(11.18) 


Equations (11.-18) imply that both [ay] and [6y] are symmetric matrices. Further, since all 
{g<} and {qj} are real, [ay] and [6y] must also be real by their definitions. Hence both 
[ay] and [6yj are real symmetric positive definite matrices of order (n x n), where n is the 
number of DOF of the system. 

Now we can proceed to form the Lagrangian and Lagrange’s equations of motion for small 
oscillations in the coordinates qi ’s. We have 


L = T — V = (aijiiqj - bijnqj) 


(11.19) 


where ay’s and 6y’s are constants independent of time and coordinates, and the summation 
over i and j is implied. Lagrange’s equations of motion are, therefore, given by 

(ayg, + 6yg>) = 0 * = l,...,n (11.20) 

These are n equations of coupled harmonic motion, each equation having n coupled 
terms involving, in general, all the n generalised coordinates and their second order total 
time derivatives. 

Let us look for a solution corresponding to each of the n coordinates executing an SHM 
with a single period (or, equivalently, with the same angular frequency, say p ) given by 

qj = Ajexp(y/-l pt) for j = l,...,n (11.21) 

with the amplitudes Aj which are in general complex (containing a phase factor exp y/—l 9). 
If such a secular solution exists, it would correspond to what is called normal mode of small 
oscillations in which each and every generalised coordinate of the system (that is, the system 
as a whole) oscillates with a single frequency p, but the amplitudes (and phases) of oscillation 
might differ from coordinate to coordinate. 

Let us now use the expressions (11.21) as the desired solutions to Eqs (11.20) and see 
under what conditions the system oscillates in a normal mode. So we substitute these 


Copyrighted Tatcnall 



322 Classical Mechanics 


is the most general solution for any arbitrarily excited oscillation of the system, provided 
all the normal mode solutions are linearly independent. For example, when a string is 
arbitrarily excited it vibrates with all possible harmonics, but one can always excite it in 
such a special way that the whole string vibrates with a single frequency, corresponding to 
particular normal mode. 

11.2.1 Properties of Ai for All the n Distinct Values of p 2 

Let us select any two distinct eigen-frequencies for two distinct normal modes with p 2 = p 2 
and pp say. These would then satisfy 

Y - a HPl A i° + = 0 


Y - a ijPl A i0 + Y b *J A W = ° 


a and 0 not being summed over. We retain the summation signs for reasons of clarity, 
albeit sacrificing brevity. These equations are each a set of n equations with the set of 
n + n unknowns, {Aj a } and {Aip} , respectively. Multiplying the first by {A i0 } and the 
second by {Aj a } and summing over all the indices (except, of course, a and /?), we get 

- Y ai i v ^ A ^ A i^ + Y bi * Ai 0 A j a = 0 
« « (11.28) 
“ >. a iiPB A i0 A ia + Y b 'J A '0 A j <* = 0 

Now subtracting the first from the second of the above sets of equations 

£ •«•*<*»*, (»t - fi) = « (11-29) 

Since by our assumption the two roots are distinct, p 2 * & we get 

Yt a *j A iP A j<* = 0 for a ^ 0 (11.30) 

iJ 

But if we had chosen the two roots p 2 and p£ to be identical, we would then have got 
from Eq. (11.29) 

Y a 'j A '0 A i a ^ 0 f° r a = 0 


However, since o<y is a positive definite real symmetric matrix, we know that £ OijA ia Aj a 
= A q say, must be a positive quantity, if not equal to zero, which is the second case above. 
In fact one can easily normalise and A ja by dividing them by %/S^ = C a , say, such 
that 

Y a ijAiaAjc. = i (ii.3i) 


Copyrighted 




324 Classical Mechanics 


where use has been made of the relations (11.32 a,b). 

Thus in terms of these new coordinates {<?«} and their corresponding coordinate veloci¬ 
ties {(?q} the old [a,j] matrix has transformed into an identity matrix and [6<jJ has turned 
into a diagonal matrix with the diagonal elements as p \,... ,p*, that is 

[bij] = diag (Pi,• • ■ »Pn) 

We can say that the coordinate transformation (11.34) has completely diagonalised the 
matrices [a,-j] and Any transformation that diagonalises a matrix is called a principal 
transformation or a similarity transformation. Thus the transformation given by Eq. (11.34) 
is called the principal transformation and the coordinates {Q q } are called the normal 
coordinates. In terms of these normal coordinates, the Lagrangian becomes 

L = T - V = £<£ - (11.38) 

each coordinate term being orthogonally separated. The Lagrangian equations of motion 
expressed in the normal coordinates reduce to 


d , 

( & L\ 

(°L\ 

dt 1 

\dQa) 

\9QJ 


or 

Qa + plQc =0 a = 1,... ,n (11.39) 

with the solution for each normal coordinate Q a 

Q a = C a cos {p a t + 0 a ) (11.40) 

which naturally justifies the principal transformations themselves as expressed in Eq. (11.341. 
So in terms of the normal coordinates {<?„}, the system is effectively decoupled into n in¬ 
dependent simple harmonic oscillations in n normal coordinates. 

The principal oscillation corresponding to each distinct a (or normal mode) is given by 
the n-dimensional vector in the configuration space of the dynamical system q„{q \ n ,..., q n a) 
with individual components 

qia = C a Aia COS (p n t + 0„) (H-41) 

where a is not to be summed over. There are n such principal oscillations, one for each 
normal mode, or equivalently for each a. These are n orthogonal vectors in the configuration 
space. A superposition of all these n principal oscillations for n distinct normal modes 
gives any arbitrary oscillation of the system, which is, as it should be, just an arbitrary 
vector in the configuration space. 

It is to be noted that the expressions for the principal oscillations contain time depen¬ 
dent periodic terms. Does it then mean that the directions of principal oscillations when 
mapped in the configuration space are continually changing with time? Fortunately, they 
do not, simply because all the coordinates for a given principal oscillation have identical 
periods, and identical phases apart from a difference of 180 degrees for some coordinates. 
So each normal mode solution corresponds to an oscillation in a particular direction in the 


Copyrighted material 


Small Oscillations 325 


configuration space. The normal coordinate for a given principal oscillation represents the 
actual behaviour of the displacement along the direction of principal oscillation. We can of 
course imagine the normal coordinates to span a new n dimensional configuration space, 
in which the individual principal oscillations will execute simple harmonic motions with the 
respective normal mode frequencies along the respective new coordinate axes. 

11.2.4 Degenerate Cases 

For the case when degenerate roots, say p 2 a , of Eq. (11.23) have multiplicity m, Lagrange 
had thought that the general solution of such problems of small oscillations might not be 
expressible as a linear combination of n normal mode solutions because the latter are not 
linearly independent. However, it was later demonstrated by Weierstrass that for each root 
of Eq. (11.23) with multiplicity n > m > 2 there exist exactly m linearly independent 
solutions of the system of n linear equations so that a total of n linearly independent 
amplitude vectors can be found including the ones for the degenerate modes. Hence the 
above form of the general solution remains valid even for a system .having some degenerate 
modes of multiplicities m > 2. 


11.2.5 An Example of Small Amplitude Oscillations : Compound Pendula 


Two pendula having identical bobs of mass (m) and lengths of suspension (/) are hung 
from a horizontal ceiling some horizontal distance (/ 0 ) apart and are connected by a spring 
(of constant k and length l a at relaxation) the ends of which are tied with the strings 
of suspension at a distance h below their respective points of suspension. Let the angles 
of deflection fa and fa of the pendula from the respective vertical lines in their common 
plane of oscillation be the two independent generalised coordinates. Assume <j>\ and fa to 
be small enough to be called small oscillations about their stable equilibrium configuration 
viz., fa = fa = 0. First we form the Lagrangian. 

The kinetic energy T = \ml 2 {<j> 2 -f 4 >\) = giving 


aij 


(i™P 0 \ 

l 0 Imp) 


The potential energy is, assuming <f>\ and fa to be small, 

V = mgl( 1 - cosfa) + mgl( 1 - cosfa) + ^ kh 2 {fa 
- + M 2 )(0i + <t>\) - kh 2 fafa 


giving 


\kh 2 


b . _ (+ . . 
13 ~ V i(mgl + kh 2 )) 


kh 2 ) 


The matrices (a*,-) and [6^y] are real, symmetric and positive definite; [oij] is diagonal but 
not [fty]. 


Copyrighted material 



326 Classical Mechanics 


Therefore the secular Eq. (11.23) becomes, in this case, dropping all the factors 1/2 

det II ( mgl + kh *) “ P 2m/2 “ kh2 || _ o 

dCt || - kh 2 (mgl + kh 2 ) - p 2 ml 2 || “ 0 


2 g , 2 me/ + 2kh 2 g 2 kh 

p a - - and p 0 = “ J + ^2 

Thus these p a and pp are the frequencies of the two normal modes of oscillation. 

The amplitudes of small oscillations for these two normal modes can be found from solving 
the respective sets of linear equations namely 

(bij - p 2 a aij)Aaj = 0 
for the first normal mode (frequency p a ) and 

{bij - p 2 0 aij)A(jj = 0 

for the second mode (frequency pp). 

For the first mode we have 

(&n ~ Pa®n)^ai + (612 - p\<i\ 2 )A a i = 0 ( 1142 ) 

{bil - Pab2l)A a i + (622 - p 7 a O-22)A a 2 = 0 

Solving these we get 

A a \ = 4 q2 = const, (say Ci) 

so that the oscillations in <f>\ and fa for the normal mode frequency p a — \fgfl are 
0 la = Cl C08 [p a t + 6 a ) 
and 

02a = Cl COS {p a t + 0 a ) 

Thus both the pendula oscillate in phase with the same frequency as p a = y/gjl , 
i.e., they oscillate together keeping the bobs always at the same separation as the distance 
between their points of suspension. An abstract vector formed out of the sum of these two 
components is a principal oscillation corresponding to the normal mode frequency p a . 

Similarly, for the second normal mode, one replaces p 2 a in Eqs (11.42) by p 2 0 and 
A a i, A a 2 by A 0 1 , A /32 respectively, and obtains a solution 

Api = - A 0 2 = C 2 say 

Hence 

0 10 = C2COs(p 0 t + Op) 


02/9 = ~ C 2 COS (ppt + Op) = C 2 COS {ppt + Op + 71 


Copyrighted 




Small Oscillations 327 


Thus in this normal mode of oscillation with frequency 


PP 



2 ktf 
ml 2 


> Pa 


the pendula will oscillate 180 degrees out of phase, that is, both of them will approach each 
other for a while, then go apart and come back and so on. The second principal oscillation 
is represented by the linear superposition of these two individual oscillations in two different 
coordinates both executing in the second normal mode of frequency pp. 

Now any general oscillation would be a linear sum of these two principal oscillations, and 
obviously it would be a vector in the configuration space of this dynamical system. If we 
prefer to write this general solution in two of its generalised coordinate components, they 
would look like 

<f >i = C\ cos (p a t + 0 a ) C 2 cos (ppt + dp) 


and 

<fo = Ci cos(p Q f + 0 Q ) - C 2 cos(ppt + Op) 


It is also seen that the components of any particular principal oscillation have the same 
amplitude, same frequency and same argument for cosine function except for an occasional 
change by ir. So it is better to define all these components by a single entity. This is done by 
introducing the normal coordinates, one normal coordinate for each normal mode or equiv¬ 
alently for each principal oscillation. In the present example, the two normal coordinates 
are 

Q a = C\ cos(p 0 t + 0 a ) and Qp = C 2 co%{ppt + Op) 

so that 

<t> 1 = (Qa + Qp) and fa = (Q a - Qp) 

It is now easy to find out the directions of the principal oscillations in the configuration space. 
Since the components of the vectors (0i,^) are, by definition, the generalised coordinates 
orthogonal to each other, the principal oscillations represent a pair of orthogonal vectors at 
an angle of 45 degrees to the former. 


11.3 FORCED VIBRATIONS AND RESONANCE 

Until now, we have considered the free oscillations of a system around its equilibrium con¬ 
figuration, that is, the oscillations of the system when slightly disturbed initially from the 
equilibrium configuration and then allowed to oscillate by itself. In a variety of situations, 
however, the system is set into oscillation about its equilibrium configuration by an exter¬ 
nal force varying periodically with time. These are the so-called forced oscillations whose 
frequency is determined by the frequency of the driving force rather than by the normal 
mode frequencies. However, the normal modes of free oscillations play a crucial role in the 
analysis of forced oscillations. In particular, the problem of obtaining the amplitudes of the 
forced oscillations is greatly simplified by use of the normal coordinates obtained from free 
oscillations. 


Copyrighted material 



328 Classical Mechanics 


Suppose Ff) is the generalised force corresponding to the generalised coordinate, say qp , 
then the generalised force corresponding to a lh normal coordinate Q a is given by 

The equations of motion, when expressed in normal coordinates become (see Eq. (11.39)) 

Q* + plQa = fait) (H-44) 

Equations (11.44) are a set of n inhomogeneous differential equations that can be solved only 
when we know the dependence of / on time. Note, however, that the normal coordinates 
preserve their advantage of separating the variables. Thus each of the Eqs (11.44) involves 
only a single variable and can be solved independent of others. 

In many real systems the oscillations are damped by the resistance of the medium in 
which the system is oscillating (or by some other reason). The corresponding forces, acting 
on the system, are dissipative in nature and are proportional to the velocities of the particles 
of the system. As described in chapter 2, these can be obtained from a dissipation function 

R = (n.45) 

where hjk is a symmetric matrix. In order that the normal coordinates exist, we must find 
out a principal axis transformation which simultaneously diagonalises the matrices [o»,], 
[6,vj] and [h^] defined by Eqs (11.18) and (11.45). Such a principal axis transformation does 
not necessarily exist in every case. However, henceforth we specialise in the case where a 
transformation to normal coordinates is possible. In such cases, Eqs (11.44) are replaced by 

Qa + k„Q a + plQ„ = fait) (11.46) 

which again are n uncoupled differential equations, each involving a single coordinate, and 
hence can be solved independently of others. 

Thus the problem of forced oscillations for each normal coordinate boils down to solving 
a differential equation of the form 

x + 20x + u fa = ^ (11.47) 

m 

where Q a is replaced by *, k a by 2/9, p* is replaced by u)l = p* and f a (t) by 
Here mu>„ = k say, is the spring constant and m is the mass of the system. We shall refer 
to the physical system obeying Eq. (11.47) as an oscillator driven by a force function f{t). 

Now in many instances, the time dependence of the force function f(t) is a simple 
sinusoid. For example, in an acoustic problem, the driving force might arise from the 
pressure of a sound wave impinging on the system and /(<) then has the same frequency 
as the sound wave. Or, if the system is a poly-atomic molecule, a sinusoidal driving force 
is present if the molecule is illuminated by a monochromatic light beam. Each atom in 
the molecule is then subjected to an electromagnetic force whose frequency is that of the 
incident light. Even when f(t) is not sinusoidal with a single frequency it can often be 
considered as a superposition of such sinusoidal terms. Thus if /(/) is periodic it can 


Copyrighted material 



Small Oscillations 329 


be represented as Fourier series, otherwise a Fourier integral representation is suitable. 
Since Eq. (11.47) is linear, its solution corresponding to a given /(<) can be obtained by 
superposing the solutions corresponding to the sinusoidal terms in its Fourier representation. 
It is therefore of general importance to study the nature of solutions of (11.47) with f(t) 
having a sinusoidal variation with time. We consider, therefore, 


x + 20x + vlx = — cos tat (11.48) 

771 v ' 

where f a is a constant independent of x and t. 

The complete solution of Eq. (11.48) is the sum of the complete solution of the homoge¬ 
neous equation 

x + 20x + u)\x = 0 (11.49) 

and any particular solution of Eq. (11.48). The term consisting of the complete solution to 
the homogeneous equation will be damped out with time, as long a s 0 > O and is called 
the transient term. The remaining term in the solution persists in time and is called the 
steady state solution. We are interested in the steady state solution to Eq. (11.48) which is 
given by 


where 


x = A cos(wt — <j>) 


(11.50) 


fo/m 

- u> 2 ) -f 40 2 u) 2 


(11.51) 


and 


2u>0 


We see that the motion is sinusoidal, with the angular frequency u which is the same 
as that of the driving force. The phase (wt — <f>) lags behind the phase cut of the driving 
force by tf> radians. The phase angle <f> depends on the frequency of the driving force and 
ranges from zero to ir as u> ranges from 0 to infinity. The amplitude A which depends on 
the magnitude and frequency of the driving force is considered in subsection 11.3.2. 


11.3.1 Energy Considerations 


In many problems we are interested in the amount of energy that is stored in the oscillator 
in the form of kinetic or potential energy and also in the amount of work that must be done 
by the driving force to maintain a given amount of energy in the oscillator. 

The average potential energy and the average kinetic energy stored in the oscillator can 
be found by averaging the quantities kx 2 /2 and mx 2 /2 over a single period. Thus we get 


(\ kx2 ) = “jf ^k[Acos(u>t - <f,)] 2 dt 

_ leA 2 rnwlA 2 

~ ~4~ ~ 4 


(11.52) 


Copyrighted 



330 Classical Mechanics 


and 


^mi 2 ^ = - £ ^m [- 4u>sin(ii;< - <t>)f di 


mu 2 A 2 


(11.53) 


where A is given in Eq. (11.51). 

In the steady state, the values of both kinetic and potential energies at the beginning of a 
period are the same as those at the end of a period. Hence the driving force is needed to do 
work only to supply the energy dissipated due to the damping force. The energy dissipated 
by the damping force in one period or equivalently the negative of the work done by the 
damping force in one period is 


AW 


-f 

Jz(t 


2m0xdx = 2m 


/■t+T 

l 0i 


0x 2 dt = QmJ 1 A 2 T 


(11.54) 


A measure of the efficiency of a given oscillator for energy storage is given by the Q 
factor defined by 


Q = 2* 


average stored energy 
energy dissipated per cycle 


= 2ir 


mui 2 A 2 /4 -f mu) 2 A 2 / 4 
/ 3mu) 2 A 2 r 


(11.55) 


4/3u> 


Here we have used Eqs (11.52) and (11.53). So 

q « r 1 


and when u) ~ u) 0 . 



(11.56) 


11.3.2 Resonance 

We know that the normal coordinate x(t) and the corresponding velocity x(t) are sinusoidal 
functions with amplitudes A and A' respectively given by Eq. (11.51), and that 

A' = u)A (11.57) 

If the magnitude /„ of the driving force is held fixed and the frequency u is varied, and 
if 0 < u)„, the quantities A and A' will each have a maximum value for certain frequency. 
If 0 < u)„ the quantities A and A! are sharply peaked around their maximum values as 
illustrated in Fig. 11.3(a) and 11.3(b). This rapid enhancement in the value of A or A' in 
the neighborhood of a certain frequency is called a resonance and the frequency at which a 
resonance occurs is called the resonant frequency. 

The frequency u)r at which the amplitude A becomes maximum is called the displace¬ 
ment resonance frequency and is found by maximising Eq. (11.56) with respect to u> or 


Copyrighted material 



332 Classical Mechanics 


Similarly the value of Au; for the kinetic energy resonance, when 0 < <v 0 , for the potential 
energy resonance curve is 20. Note that in all these cases the width of the resonance curve is 
a measure of the strength of damping. The greater the damping the wider is the resonance 
curve. Further, when u = u a the Q factor of the oscillator is 



so that Q 0 is inversely proportional to the width of the resonance curve. The sharper the 
resonance, the larger is the value of Q a . It can be shown that, in the absence of the driving 
force, the amplitude of the damped oscillator decays with a time constant \/0. Thus the 
relaxation time is inversely proportional to the width. 

The above analysis leads to the conclusion that a great deal of information about the 
oscillator can be obtained from one of its resonance curves. The location of the resonance 
gives its natural (normal mode) frequency lo 0 and the width helps us determine the damping 
factor. These can further be related to the Q factor, the relaxation time, the restoring and 
the damping forces, etc. In many instances it is easier to obtain a resonance curve than to 
obtain the information on these quantities directly from their definitions. 

Note that in principle, at resonance, only a single natural frequency of the system is ex¬ 
cited. Indeed, at resonance the amplitude corresponding to resonant frequency far exceeds 
that corresponding to any other frequency that may be excited due to some stray distur- 
bances.or noise. Thus, at resonance, the signal to noise ratio becomes very high. Therefore 
the resonance phenomenon offers a very accurate and sensitive spectroscopic technique to 
study the natural excitations of the system. Electron Paramagnetic Resonance (EPR), Nu¬ 
clear Magnetic Resonance (NMR) and Mossbauer spectroscopy, are but a few examples of 
highly sensitive and accurate spectroscopic techniques based on the phenomenon of reso¬ 
nance. 


11.4 SUMMARY 

The different types of equilibrium configurations are classified into stable, unstable and 
metastable states. Also the equilibrium itself could be static or dynamic. 

Obviously, the form of the Lagrangian for small perturbations about any one of the equi¬ 
librium configurations is drastically simplified. If the equilibrium point under consideration 
is a stable one, there always exists a solution that represents oscillation. More surprisingly, 
such oscillations are invariably of the simple harmonic type, with periods independent of 
the amplitude of perturbation, provided of course, the amplitudes are small. 

It is always possible to excite the system which may have an arbitrarily large number of 
degrees of freedom, in a manner to execute simple harmonic oscillations in all generalised 
coordinates with a single frequency. This is called a normal mode of oscillation. Using the 
Lagrangian method, it is shown that the equations of motion reduce to a set of linearly 
coupled algebraic equations in amplitudes, and that the solution for all possible normal 
modes reduces to one of the standard eigenvalue problems. If the the number of DOF is n, 
the maximum possible number of independent normal modes is obviously n. Furthermore, 
for each normal mode, the relative amplitudes of oscillation in individual generalised coor- 


Copyrighted material 



Small Oscillations 333 


dinates can be arbitrary, but not the relative phases, which are very strictly either 0° or 
180°, and nothing in between. 

Because of the last condition, for any given normal mode of small amplitude oscillation, 
the dynamical representation of the system in its n-dimensional configuration space corre¬ 
sponds to a simple harmonic motion in a straight line. This is called a principal oscillation 
corresponding to the given normal mode. A single coordinate, say Q , may also be defined 
as a measure of the total instantaneous displacement along this line from the equilibrium 
position in order to represent this oscillation in the configuration space. This is precisely the 
normal coordinate. Therefore, in a given normal mode, a suitably defined normal coordinate 
will execute the corresponding principal oscillation. 

Hence, there can be at most n different principal oscillations and the corresponding 
normal coordinates. A linear superposition of these principal oscillations can represent any 
arbitrary oscillation produced by arbitrarily exciting the system. 

In the presence of damping and externally impressed oscillations, the system responds in 
resonance whenever the frequency of the impressed oscillation is chosen very close to one 
of the natural frequencies of oscillation of the system. Since the power consumption due 
to damping increases as the amplitude of oscillation grows, the amplitude of the resonating 
system finally adjusts to a maximum value when the power fed by the impressing system 
exactly balances the power lost due to damping. 


PROBLEMS 

11.1 A piston of mass m divides a cylinder containing gas into two equal parts. Suppose 
the piston is displaced to the left a distance x and let go. Find the frequency of the 
piston’s oscillation , if the process takes place 

(i) at constant temperature and 

(ii) adiabatically. 

11.2 Find the differential equation for the contour of a constraining surface on which a 
point mass will oscillate with a period independent of the amplitude. 

11.3 Professor I. Rabi was once running a cosmic ray experiment on a mountain top and 
noticed that his mechanical wrist watch (vibrating balance wheel type) started run¬ 
ning slightly faster. He posed the problem to Professor Fermi during a train ride, who 
thought for a while and produced a fully quantitative explanation within ap hour. 
Could you figure out the line of reasoning? 

11.4 A spherical ball rolls in a quarter-circle track suspended as a pendulum bob. Solve for 
the coupled oscillations of the system and show that the dynamical coupling b*ettveen 
the ball and the track produces some dramatic starts and stops. 

11.5 Atwood’s oscillator Consists of a solid disc of mass M and radius R, pivoted at the 
centre, with an additional mass m placed at a distance r from the centre. The 
equilibrium is maintained by a mass m', which is suspended from a massless string 
wrapped around the disc. At the equilibrium position, the line from the centre of the 


Copyrighted 



12 

Rigid Body Dynamics 


12.0 INTRODUCTION 

We have written a rather long chapter on this topic, the reason being our attempt to 
elaborate on the basics that involve conceptual clarity of the motions of rigid bodies. It has 
been our sad experience that most students appearing for the qualifying orals for admission 
to the Ph.D. programmes in physics at research institutions do fairly badly in the rigid 
body dynamics part. We have therefore covered parts of the undergraduate syllabus in 
great detail in the first half of the chapter, and more advanced topics are presented in the 
second half. Section 12.28 deals with a totally new but exciting topic, a brief treatise on 
the dynamics of sports and games. Have fun reading it! 

After Newton, the most ingenious man to have exploited the full power of calculus in 
applying Newton’s laws of motion to rigid body dynamics and continuum mechanics is said 
to be Leonhard Euler, the most prolific mathematician of all time. A straightforward list of 
Euler’s works would occupy no less than 80 pages of this book, and all his works, altogether 
866 papers, amount to 69 large volumes. Euler was one of the great mathematicians who 
could work anywhere under any conditions. He would dash off a mathematical paper in 
the half an hour or so between the first and second calls to dinner. The ease with which 
he wrote the most difficult mathematics is incredible. Publishers running short of material 
would come and pick up the top deck of his piled up manuscripts, so in most cases the actual 
chronology is lost. 

Born in Switzerland in 1705, Euler joined St. Petersburg Academy in Russia in 1725, lost 
the sight of one eye in 1736, joined Berlin Academy at the invitation of the King Frederick, 
became totally blind in 1766 and returned to St. Petersburg Academy, and worked with 
full enthusiasm till he died in 1783. He just finished the calculation of the orbit of Uranus, 
which was recently discovered by William Herschel, and died following a heart attack. 

He introduced analytical methods into mechanics as early as 1736 and made quasi- 
axiomatic use of the principle of virtual work. He deduced the differential equation for 
the problem of minimising integrals in 1744. He began to use undetermined multipliers 
before Lagrange reinvented them. When Lagrange at the age of 23 sent his brand new 
method of tackling dynamical problems to the 54-year old Euler for comment, Euler found 
Lagrange’s analytical method far superior to his semi-geometrical methods and immediately 
used it to solve one of the outstanding problems of the day, but did not publish it before 
he could convince Lagrange to publish his method first. Euler was always generously ap- 


Copyrighted material I 



336 Classical Mechanics 


preciative of the work of others. His treatment of his young rival Lagrange is one of the 
finest examples of unselfishness in the history of science. D’Alembert, Euler and Lagrange 
were great friends. The entire rigid body dynamics was Euler’s one but small brainchild, 
which appeared in 1760 in the form of a book entitled Theoria motus corpum solidorum 
seu rigidorum. After one hundred years, another notable book Dynamics of rigid bodies by 
Edward John Routh truly surpassed Euler’s work. 

Apart from Euler and Routh, substantial contributions have come from Arnold Sommer- 
feld (1868 - 1951). While professor of Mathematics at the Bergakademie at Claustal, he 
began with Lord Klein in 1897 the preparation of a four volume classic treatise on the gyro¬ 
scope, called Theorie des Kreisels , that was to take 13 years to complete. Joseph Lagrange, 
Edward Routh and Louis Poinsot have supplemented the Eulerian schemes. In fact, rigid 
bodies appear in so many different contexts and in so many different ways that there always 
remains a scope for completely solving even ‘simple’ problems like the motion of topsy¬ 
turvy tops, rotating pebbles, boomerangs etc. Contributions to specific problem oriented 
topics in rigid body dynamics are innumerable. Sometimes these are questions of correctly 
formulating the problem, if not finding solutions in closed forms. 


12.1 DEGREES OF FREEDOM OF A FREE RIGID BODY 

We already know that the distance between any two constituent particles of a rigid body 
remains fixed (by definition) throughout the motion of the body. In other words, the motion 
of a rigid body is constrained by the requirement that the distance between any two of its 
particles remain the same for all time. We have also seen that the rigidity constraint is 
holonomic. 

The number of DOF of a free rigid body is the minimum number of independent coor¬ 
dinates required to describe all possible configurations of the rigid body. In the course of 
dynamical motions, a rigid body assumes various possible configurations maintaining, of 
course, the rigid body constraint. Let us assume that a rigid body has N particles in it 
( N > 3) and it has n DOF. If we now choose any particular particle in it, it can be trans¬ 
lated to any desired point in the 3-D Euclidean space. Therefore, this particular particle 
must have three DOF. Let us now fix this particle at some point in 3-D space, so that we 
take away three degrees of freedom from the total number of DOF for the whole system, 
that is, if it had originally the number of DOF = n, now its number of DOF = n — 3. 

Now let us choose a second particle in the rigid body. This particle should have had 
number of DOF = 3 before the fixation of the first particle, but since the first particle’s 
location is fixed and the distance between the first and the second particle must remain 
unchanged due to the rigidity constraint, the corresponding constraint relation has to be 
satisfied. Hence the the number of DOF of the second particle, subject to the fixation of 
the first particle, is only 2. In fact, the second particle can now lie anywhere on the surface 
of a sphere of radius equal to the distance between the first and the second particle with 
the centre located on the first particle. Thus the second particle cannot translate at all but 
can only rotate about the first particle. If we now fix the second particle at some desired 
point on the sphere of its allowed 2-D motion, we further take away 2 degrees of freedom 


Copyrighted material 



Rigid Body Dynamics 337 


from the system. Hence the total number of DOF left for any rigid body having any two 
particles fixed in space is n - 5. 

Let us now consider any third particle that may or may not lie on the line joining the 
first two particles. If the third particle lies on the previous line it cannot have any degree of 
freedom, in which case we have to go for the next and next particle until we choose a particle 
outside the above line. So long as these particles are chosen to lie on the same line defined 
by the first two particles, fixing these particles can be done without loss of any further 
degrees of freedom. As soon as we choose a particle that does not lie on the above line, this 
particle can move only in a circular track about the above line, because it has to satisfy 
two constraint relations, namely, its distance from the two previously fixed particles must 
remain constant. This is once again a 1 -D rotational DOF. If we now fix this off-the-line 
particle, the system loses a total of 6 DOF and the number of DOF left is n - 6 . Now after 
fixing three particles, lying on the vertices of a triangle, any fourth particle must satisfy 
three constraint relations, hence its number of DOF = 0. Any fifth particle will have to 
satisfy more than three constraint relations so for it also no DOF is left. This is true for 
all the rest of the particles implying that the rigid body is now left with no more DOF. 
Therefore n — 6 = 0, or, n = 6 . 

Thus any rigid body consisting of at least three particles, not arranged in one straight 
line, must have six DOF. In other words, any rigid body can have at most six DOF of which 
three are translational and the rest three are rotational DOF. The implication of this result 
is stated in the following points. 

(i) Six generalised coordinates (<Zi, • • •, ft) are sufficient to describe the dynamical motion 
of any rigid body, out of which three are chosen for the location of any point fixed with 
respect to the rigid body and the other three for all possible rotations of the rigid body 
about this chosen point. Note that this point can be inside or outside the rigid body. If 
the point is chosen outside the physical boundary of a rigid body a suitable massless rigid 
pointer has to be firmly attached to the rigid body which will have the above point on 
its arrowhead. For the six generalised coordinates we need to define the corresponding 
generalised velocities ( 91 ,•.-, 90 ), and canonical momenta (pi,.--,pe), through a suitably 
defined Lagrangian function L — L(q u ..., 90 , 91 , • • • , 9 oj0- 

(ii) A rigid body must have 2 n = 12 independent constants of motion, of which 11 
must be totally time independent. We shall see later that for a freely moving rigid body 
the constants of motion that can be easily identified are its total linear momentum (3), 
total angular momentum (3), translational kinetic energy (1), rotational kinetic energy (1) 
and the vector condition for Galilean invariance (3). The last three are in fact explicit 
functions of time, of which two can be made time independent by substituting the time 
dependence derived from the first. So in all these constitute 11 constants of motion. The 
twelfth independent constant of motion, unlike the other 11 , may not always be expressed 
in closed form. This will become more apparent in section 12.14. 

Just as the kinematical motions of a rigid body can now be studied without making 
any explicit reference to the motion of its N constituent particles, a full (dynamical) 
description of the rigid body is also possible without the specific details of its mass (01 
inertial) configuration, which, as we shall see later on, can be fully expressed in terms of 


Copyrighted material 



338 Classical Mechanics 


only six independent quantities, constituting what is called the moment of inertia tensor. 
However it should be mentioned that this coincidence in number for the degrees of freedom 
and the independent components of the moment of inertia tensor corresponds to the fact 
that both are n(n + l)/2, where n is the number of the spatial dimension. 


12.2 EULER’S AND CHASLES’ THEOREMS 

By a rigid displacement we mean any possible displacement of a rigid body in the real 3-D 
Euclidean space. 

12.2.1 Euler’s Theorem (1776) 

Any rigid displacement is a combined result of rotation and translation. 

This theorem can be rigorously proved using the properties of the Euclidean group which 
is shown to be isometric first with the operations of rigid displacements and then with the 
group of linear transformations corresponding to translation and rotation. However our 
approach will be geometric in nature. 

Let us take any rigid body and note its location and configuration. We now give some 
arbitrary displacement to this body thus taking it to some final state. We note the final 
location and configuration. Now the question is, can there be a combination of translation 
and rotation that takes the rigid body from its initial state to its final state? What we 
must do first is to identify any specific particle in the rigid body, and translate the rigid 
body such that the chosen particle moves in a straight line from its position in the initial 
state to that in the final state. Now keeping the position of the particle fixed at this final 
position we just need to give a proper amount of rotation about a proper axis in order 
to make every particle in the body take up the position corresponding to the final state 
configuration. This can be done in one step, but can easily be done by a sequence of two 
rotations. For example, choose any second point in the body and rotate the body about the 
first point such that the second point coincides with its position in the final configuration. 
Then choose a third point, but not on the one on the line joining the first two points, and 
then perform a rotation about this line so as to make the third point go over to its position 
in the final configuration. This sequence of rotations will obviously make the whole body 
perfectly assume the final configuration. Since two successive rotations about a given point 
is equivalent to some single rotation about the same point, these two rotational operations 
can be regarded as a single unique rotational operation. This proves Euler’s theorem by 
geometrical arguments. 

Let us now put this proof in a mathematical form. The condition for rigidity is |r< — 
i*j| = constant for any pair of particles numbered with indices i and j. Let, due to 
any rigid displacement, r* -♦ r\ and rj —> r-'-. Then, by the condition of rigidity 
K - rj| = \n - r,|. 

Obviously, this condition is satisfied for the transformation r- = r» + a, where a 
represents uniform translation of the whole body. 


Copyrighted material 



Rigid Body Dynamics 339 


Now let us try out another transformation which is linear, and is given by 
3 3 

(r'i), = °w( r <)« ( r i')p = £ a p»( r i)» P = !» 2 » 3 

9=1 • =1 

Hence, 

Vi - r 'j \ 2 = £ a P»[( r *)« - ( r i)*l°pv[( r i), - ( r j)«] 

P.*>« 

= a p*°p9[(^»)*( r «)9 + ( r j)»( r j)q ~ ( r »)»( r i)g — ( r i)»( r «)v] 

This can be equated to |r< - r, | 2 if and only if 
a pq a p* — 

which means that the 3x3 matrix a pq has to be orthogonal. Now, all orthogonal 3x3 
matrices can be uniquely represented by some rotation in the 3-D Euclidean space. Hence 
any finite rotation is a possible rigid displacement satisfying the condition of rigidity. 

The transformation represented by inversion, that is, -♦ rj = - i\, also satisfies the 
condition of rigidity. But this is not a possible transformation, as it is discrete and cannot 
be effected by a succession of infinitesimal operations. There is no other operation known 
that can satisfy the rigidity condition. This completes the proof of Euler’s theorem. 

Note that the order of translation and rotation can be reversed, and that depending on 
the choice of the first particle, the amounts of required translation and rotation can also 
vary. But for a given translation, the rotation is uniquely fixed and vice versa. This fact is 
more precisely stated in Chasles’ theorem. 

12.2.2 Chasles’ Theorem (1830) 

Any rigid displacement can be uniquely expressed as a screw displacement where a screw 
displacement consists of the combination of a rotation with translation parallel to or along 
the same axis of rotation called the screw axis. 

This theorem is not much used as it is essentially a variation of Euler’s theorem. For 
Chasles’ theorem one has to first find out the correct axis of rotation, give the right amount 
of vector rotation and then the right amount of translation along the axis of rotation. This 
axis of rotation may in general lie outside the rigid body. 

These theorems could also have been proved rigorously by vector analysis but vectorial 
representation of any arbitrary but finite rotation involves Eulerian rotations and are some¬ 
what complicated and tedious. However we know that there are at most six independent 
variables involved in the description of the velocity of any point in the rigid body. Equiva¬ 
lents of Euler’s theorem and Chasles’ theorem exist for the velocity representation and we 
now proceed to state and prove these by vectorial methods. 

12.2.3 Euler’s Second Theorem 

If a rigid body is moving in any manner and B„ is any point of the body, then there exists 


Copyrighted material 



340 Classical Mechanics 


at any instant a vector w, such that the velocity of any particle B of the body at that instant 
is 

v = u 0 4- u> x p (12.1) 

where *„ is the velocity of B„ relative to some fixed inertial frame outside the body and 
p is the position vector B„B of B relative to B 0 . Also u is unique and is independent of 
the choice of the point B 0 . With respect to any fixed frame outside, there exists a velocity 
i* at a given instant of time such that the velocity of any point B of the body with respect 
to the above fixed frame at that instant is given by 

v = u + u> x r (12.2) 

where r is now the position vector of B with respect to the fixed frame. 

Note that both u and to arc claimed to have remained the same for any point of the rigid 
body at any given instant, so that v varies form point to point only because of differing r. 
The six quantities representing the two vectors u and u fix the velocity t; of any arbitrary 
point r in the rigid body at any given instant. With time both u and u may change, but 
at any instant, fixation of the above six quantities is sufficient to obtain the velocity of any 
point of the body at that instant, using Eq. (12.2). 



Fig. 12.1 Position vectors of the points Bj, B2 and B of a rigid body with 
respect to an outside inertial frame at O and a body frame at B„ 


Proof: Let O be the origin of the outside (inertial) reference frame and let us consider any 
two points Bi and B 2 of the rigid body and B as any arbitrary point of the rigid body (see 


Copyrighted material 



Rigid Body Dynamics 343 


can be set equal to, say, 

* = r 0 


so that 


(w x r a ) = * 0 - (u > x r„) 

r = • + » x r 


or 

v = u + u> x r (12.2) 

which is the most general result. It expresses the inertial velocity of any point in the rigid 
body at any instant, in terms of two universal vectors * and u> which are the same for 
the whole rigid body. At any instant, v is different for different points in a rigid body only 
because r is different for different points. 

Thus the inertial velocity of any point in the rigid body consists of two parts: one, the 
homogeneous translational velocity « which is the same for the whole body, and the other, 
a homogeneous rotation with the instantaneous angular velocity u, which is also the same 
for the whole body. The latter quantity is also independent of the choice of the inertial 
reference frame. The existence of uniform translational and rotational velocities describing 
the actual velocity of any point of the rigid body proves the velocity analogue of Euler’s 
theorem. 


The screw motion view of the velocity vector can also be ascertained in the following way. 
Decompose * and r into components purely parallel (to the the screw axis) and perpen¬ 
dicular to the u vector, so that from Eq. (12.2) 

/»«\ 

* = \-jrr + + " x rj - 

Since both and u x r± are in the same plane perpendicular to ur, we can define a 
unique vector R in the plane perpendicular to ta through the equation 

= * x R (12.7) 


The vector r± can be written as 


Equation (12.7) gives 



*i + (w x ri) = w x (A + r±) 


Hence 

v = + tit x (Jl + r x ) 

The first term corresponds to a translational velocity along the axis of rotation and the 
second term to a pure rotational velocity corresponding to some effective radius vector 
(R + rx) which lies perpendicular to the axis of rotation. Therefore, the velocity due to 
any rigid motion can be viewed as a screw velocity consisting of a translational velocity 
along the screw axis and a rotational velocity about the same axis. 


Copyrighted materia) 



344 Classical Mechanics 


12.2.4 Instantaneous Axis of Rotation 

Now, how to find the instantaneous axis of rotation? Note that the equation p - u> x p is 
valid in a frame of reference whose origin is at any arbitrary point B 0 in the rigid body, the 
direction of its axes being fixed in space (that is, not rotating with the body). An observer 
sitting at B 0 and moving with such a frame of reference will note, at any instant, u> to be 
the instantaneous axis of rotation. Since u appears to pass through B c , this observer will 
find himself or herself sitting on the instantaneous axis of rotation. But B c is an arbitrary 
point and hence, the above statement will be true for every point in the rigid body. 



Fig. 12.2 A distant star is seen to revolve in the sky about the the celestial 
pole due to diurnal rotation of the earth 


For example, consider an observer on earth, trying to find the location of the earth’s axis 
of rotation by looking at only the most distant stars ( see Fig. 12.2). Such an observer will 
find himself sitting always on the instantaneous axis of the earth’s rotation as all distant stars 
would seem to be rotating about him/her. But we know for certain that for an observer in an 
outside fixed inertial frame, the earth’s true axis of rotation passes through the centre of the 


Copyrighted material 





Rigid Body Dynamics 347 


nonrotating. However this does not prohibit us from expressing the vectors u, t* 0 , v and 
p as measured in an inertial frame in terms of the instantaneous basis vectors of the body 
frame. Thus we can use Eq. (12.1) to obtain the inertial velocity of a particle in the rigid 
body and express all the vectors occurring in it in terms of the instantaneous basis vectors 
of the body frame. Henceforth we shall always use Eq. (12.2) and mention the frame of 
reference whose basis vectors are used to express the vector quantities occurring in it. Table 
12.1 summarises various interpretations of *, u and r occurring in Eq. (12.2) for different 
frames of reference, the left hand side of which stands for the instantaneous inertial velocity 
of a particle of a rigid body that is moving in any possible manner. 

For a rigid body rotating about a fixed point in it, the types II and III in the above table 
would be the most suitable frames with the origin chosen to be the fixed point and Eq. 
(12.2) reduces to 

v = U) x r 

But the body frame is moving in any manner except that even when the axis of rotation of 
the body is passing through its origin, it is generally nonzero. 


12.4 KINETIC ENERGY OF A ROTATING RIGID BODY 

A rigid body is moving in any manner. We are interested in deriving an expression for its 
kinetic energy T at any instant with respect to any fixed outside inertial observer. Our 
tarting formula would be 

T = i £) (12.10) 

1 k = 1 

where summation extends over all the N constituent particles of the rigid body and the 
individual terms represent the inertial kinetic energy of the individual particles. The ex¬ 
pressions for the inertial velocity v k must come from the general Eq. (12.2) which however 
can be interpreted in three different ways for the three different types of frames (see Table 
12.1). So 

T = + « * »*k)| J 

= + «^mi(tf x n) + x r *M w x **) 

k k k 

= -Mu 2 + 5^m t r*.(* x «) + - ^ m* [(a> • w)(r fc • r k ) - (w r k )(v •»•*)] 

= ^Mu 2 + (* x u)-Y^m k r k + ^^m k [uj 2 r 2 k - (u> ■ r k ) 2 ] 


Copyrighted materi; 



Rigid Body Dynamics 349 


12.5 ANGULAR MOMENTUM 


The angular momentum is defined to be the conjugate momentum corresponding to the 
angular velocity, that is, for a given Lagrangian L, the angular momentum 



by definition. Since the Lagrangian is a quantity that ought to be evaluated in terms of 
the inertial kinetic and potential energies (which may however be conveniently expressed in 
terms of quantities defined in noninertial frames), we use Eq. (12.11) for the inertial kinetic 
energy, and also assume that the potential energy is independent of uf. We get for the ith 
Cartesian component of the angular momentum 


L _ d'L 6{T - V) _ 3T _ d{T m + T r ) 
* du>i du>i du>i flor¬ 

as T = T t + T m + T ry and T t does not depend on w. Now 


(12.13) 


= M(R^ m x t»)i 
= jR cm x (M*)!, 

The right hand side represents the moment of the linear part of the momentum of the whole 
body as if the whole body is effectively concentrated at the centre of mass of the body and 
is moving with a uniform translational velocity tt. 

(1214) 


This corresponds to the rotational part of the angular momentum of the body. In the 
absence of rotation, u> = 0, and this component vanishes. Therefore, the ith component 
of the total angular momentum of the body is given by 


Li = [JU x Afu]* + IijiOj (12.15) 

Now we want to see whether the angular momentum defined from the Lagrangian is the 
same as the moment of momentum of the body defined in the usual Newtonian manner. 
The moment of the total momentum of the body with respect to the origin of any fixed 
inertial frame is defined and given by 

x v 

= y^rnr x (tt + o> x r) 

= x t* + 5^[r 2 ti> - (r-o>)r] 


Copyrighted material 



350 Classical Mechanics 


-r-— 

or its ith component is given by 

= [Jlc,,, x M%]i + ^m(r 2 u;» - ryw_,r<) 

= [R* m x Mu]i + ^2 m(r 2 6ijU)j - 
= [K m x Mu]i + 

= Li 

Hence the Newtonian definition of the moment of momentum of any rigid body and the La- 
grangian definition of its angular momentum as a quantity canonically conjugate to angular 
velocity refer to the same physical quantity. 

Another point to note is that the angular momentum as measured with respect to any 
outside inertial frame about its origin contains two parts: a purely rotational part and a 
purely translational part. The translational part is nothing but the moment of the total 
linear momentum of the body as if the body has been shrunk to a point at the centre of 
mass and is moving with the inertial linear velocity of the CM. So this term will vanish if 
the observer is moving with the centre of mass of the system (i2c m ). The second part, that 
is, the rotational part does not in general point towards the angular velocity, or in other 
words, the rotational part of the angular momentum vector may have a different direction 
compared to w, unless Iij is reduced to a scalar. This is, of course, one of the well-known 
properties of second rank tensors (see Appendix A3). 


12.6 TRANSFORMATIONS OF AND THEOREMS ON THE MOMENT OF 
INERTIA TENSOR 

The concepts of the centre of mass and moment of inertia were introduced by Euler. 

Let us choose an arbitrary but fixed point in the rigid body and define a body frame with 
its origin at a point 0 in the body (see Fig. 12.3). Then the moment of inertia tensor about 
0 can be defined through Eq. (12.12) 

N 

hi = Y, m ‘ [M’% - (nMn),] (1212) 

k = 1 

The sum extends over all the N particles of the rigid body with the position vector of the 
A:th particle given by r* = {(r*)i* + (r k ) 2 j + (r*) 3 *} h {x k i + y k j + z k k} expressed 
in terms of the given body frame basis vectors (t,j,fc). 

Since the body does not rotate or translate with respect to the body frame, the position 
vector of the particles with respect to the body frame does not change with time. Thus 
the moment of inertia tensor, defined through Eq. (12.12), depends only on the mass 
distribution in the body and is a characteristic of the body itself. Further, note that if we 
choose a different point O' in the body as the origin of the body frame, all the position 
vectors, in general, change. Consequently, the moment of inertia tensor, about different 


Copyrighted material 



352 Classical Mechanics 


Iyz = ~ £mfc |IkZk = I zv = - G (12.17) 

fc 

These off-diagonal components of 7,y are called the products of inertia of the body about 
the given point and the given set of the body axes. Equations (12.17) are to be taken as 
the definitions of the symbols £>, F and G. 

The relations (12.17) show that the moment of inertia tensor is symmetric and therefore 
has only six independent components, namely A, B , C , Z), F, and G. The details of 
the structural distribution of the N particles constituting the rigid body are therefore not 
required for studying the dynamics of a rigid body. Only six quantities, namely the three 
moments of inertia and the three products of inertia of the body, contain sufficient amount 
of information on the inertial structure of the rigid body, six generalised coordinates for its 
location and instantaneous orientation in space, and six generalised velocity components, 
namely it and o>, for the complete description of its instantaneous motion. 

12.6.1 Properties of Z*y 

(a) Changes of Uj Under Translation 

Usually all the six components of the moment of inertia tensor are available for a given 
point and a given set of body axes. Sometimes it is necessary to know the components of 
the inertia tensor about a different point in the body without of course changing the inertial 
directions of the body axes. This requires a translation of the origin of the body frame by 
a finite displacement vector, say a. So let us transport the origin from 0 to O' such that 

00' = a and r' = r - a for every particle in the body, but keep the directions of the 
axes parallel to the original ones. Therefore, the (iy)th component of the moment of inertia 
tensor at the new point 0' would be, by its definition, 

7 ij = £"*(»•%■ - r i r j) 

= £™[(' 2 ~ 2 ( r o ) + “ ( r < ~ - <*;)] 

= ^m(r 2 £,y - rjfj) + ]Tm(a 2 £iy - a,ay) 

- 2 ^2m(r ■ a)6ij + ^+ a^y) 

= la + M(a 2 6ij - a,ay) - 2M{R ■ a)^y 
+ (£™r,)ay + (£mry)a, 

The above expression establishes a connection between the two sets of components of the 
moment of inertia tensor due to the translation of the origin by a. If Iij and a are known, 
the transformed M.I. tensor can in general be obtained by using the above relation. The 
knowledge of the actual configuration of the body is also essential for evaluating the last 
two terms. However the above expression can be greatly simplified if we choose the first 
origin O to be at the centre of mass of the body. Then by definition, X) mr * = MRi = 0, 
or £ mr = 0. Hence the last three terms in the above expression vanish, with the result 


Copyrighted 



Rigid Body Dynamics 355 


ues) and are called the principal moments of inertia. For these A’s the characteristic vectors 
(eigenvectors), namely ri, r 2 , and r 3 specify the corresponding directions of the principal 
axes. 


Remarks 

(i) For each point of the rigid body there exists a set of principal axes and the corre¬ 
sponding principal moments of inertia. 

(ii) If any two of the principal moments of inertia about some special point(s) in the body 
are equal then the rigid body is called a symmetric top. If all the three are identical then 
the body is called a spherical top. If all three are different for all points in the body, the 
rigid body is called an asymmetric top. 

(iii) Again, if any two or more principal moments of inertia turn out to be the same, the 
normalised eigenvectors {ni,n 2 ,f» 3 } may not always be linearly independent and cannot 
be taken as the basis vectors of the new coordinate system. However, as argued in section 
11.2.4, it is still possible to have a set of three mutually independent basis vectors, which can 
further be orthogonalised using Gram-Schmidt’s orthogonalisation procedure. Of course, the 
choice of the characteristic vectors for the degenerate eigenvalues in the plane of degeneracy 
remains open. 

(e) Moment of inertia about any arbitrary direction implied by a vector at passing thro¬ 
ugh the origin 

We choose the origin and the three rectangular Cartesian axes of the body frame for con¬ 
structing the moment of inertia tensor of a body about the origin and the given axes. This 
is shown in Fig. 12.5. Now we choose an arbitrary direction OS passing through the origin 
O, about which we want to calculate the moment of inertia of the body. Obviously this 
quantity will just be a number and it should not depend on how we have chosen the ori¬ 
entations of the axes of the body frame. Such quantities are called scalars, as opposed to 
vectors or tensors. If we denote the direction OS by the vector, say *, and the moment of 
inertia of the body about OS by /*, then from the basic definition of the moment of inertia 
about an axis, 

U = £mPN 2 

IP) 

where the sum extends over all points P of the body, and rectilinear segment PN as shown 
in Fig. 12.5 corresponds to the normal distance of the point P from the given axis OS. Since 
OP = r, and OS = at, we can write PN 2 in terms of r and at as 

. ( 4 -!)- 

riiere x = |x|. 


Copyrighted material 



358 Classical Mechanics 


for. For example, for any spherical top, we have /* =/, =/, about the centre of mass of 
the top. A spherical top does not necessarily have to be spherical in geometry. It can as well 
be a homogeneous cube, for example, for which all the three principal moments of inertia 
are identical when referred to its centre of mass. The inertial ellipsoid about its centre of 
mass is a sphere of radius l/\/7, I = Ma 2 / 6 , a being the length of any side of the cube. 
Hence, the moment of inertia of the cube about any arbitrary axis passing through its centre 
of mass is the same, which appears rather surprising for an object like a cube. Surely, one 
would tend to intuitively think that the moment of inertia about an axis passing through the 
corner of the cube for example, would be different from that about an axis passing normally 
through the middle of any two parallel faces! Now if we shift the origin of the body frame 
to one of the corners of the cube, the ellipsoid of inertia no longer remains spherical. Along 
all the axes of the four-fold symmetry of the cube and the body diagonals, the transported 
origin would produce inertial ellipsoids as prolate spheroids, and in arbitrary directions the 
inertial ellipsoids become truly ellipsoids. After pointing out asymmetries about symmetric 
objects, let us give an example of symmetries for asymmetric bodies. Take any irregular 
body, and try to draw the ellipsoid of inertia about any arbitrary point of the body — it 
is still an ellipsoid. If you come to think of it, you may find a simple answer to it. In any 
case, due to various reflection symmetries of ah ellipsoid, the momental ellipsoid can be cut 
into eight pieces equal in all respects. So there are eight directions, one in each piece, that 
will give the identical moment of inertia. And this is true for any arbitrary point chosen in 
any irregular body! 


12.7 EXAMPLES OF THE CALCULATION AND THE EXPERIMENTAL 
MEASUREMENT OF THE MOMENT OF INERTIA TENSOR 

1. A Homogeneous Right Triangular Pyramid with the Rectangular Base Sides a 
and Height 3a/2 

Shown in Fig. 12.6 is a right triangular pyramid with reference to a fixed body frame. 

We wish to obtain the moment of inertia tensor about the origin 0. With = x,y, z 
we can write 

,3a/2 ,a - (2z/3) ,a - (2z/3) - y / y* + Z 2 ~ Xy - ZX \ 

Uj = / dz I dy dx p \ - xy z 2 + x 2 - yz I 

Jo Jo Jo V - ** V* x 2 + y 2 ) 

where p is the constant mass density of the triangular pyramid, p can be expressed in 
terms of the total mass M of the pyramid as 

r 3a/2 ,a - (2*/3) ,a - (2z/3) - y t 

M = p dx dy dx = -a i p 

Jo Jo Jo 4 


Copyrighted material 



Rigid Body Dynamics 359 



Fig. 12.6 A choice of the rectangular Cartesian axes and origin of a suitable 
body frame for a right triangular pyramid 


Thus for the about 0 with respect to the xyz axes shown in Fig 12.6 we obtain 


*-¥(:■! K V) 

of inertia abou 
q. (12.26) as de 

( 15 0 0 \ 
0 5 0 

0 0 14/ 


(12.26) 

In order to obtain the principal moments of inertia about O and the principal axes of 
inertia, we diagonalise the inertia tensor in Eq. (12.26) as described in section (12.6.1) (d). 
The result is 

Ma 2 


4 ;> 


(12.27) 


with the eigenvectors for the principal axes 

= 7I (S + ] + 2i) 

where *, j and k are the unit vectors along the x, y and z axes shown in Fig. 12.6. 


(12.28) 


Copyrighted material 



Rigid Body Dynamics 365 


EI: + V 2 + z>) = 1 

12. A hollow sphere of inner radius a and outer radius b with centre at origin O 
MI: /„(0) = I„(0) = /„(0) = jA/f ” ® 5 


5 6 s - a* 

E,: + »’ + = 1 

13. A solid cylinder of radius r and length / with its axis along the z-axis and centre at the 
origin O 


Ml 


/r 2 l 2 \ Mr 2 

■■ MO) = /„(0) = M + ijj /„(0) = — 

El: T [( rl + l)<** + y!) + 2rV ] = 1 

14. A hollow cylinder of length /, inner radius a and outer radius b with axis along the 
z-axis and centre at the origin O 

M(a 2 + ft 2 ) 


MI 


: MO) = MO) = u (2^-^ + I. 


(0) 


EI: 


M \(a 2 + b 2 


"l( 


;)(.• * f) . (.• * tv] - . 


15. A solid right elliptical cylinder of height h and transverse axes 2a and 2b with longitu¬ 
dinal axis of the cylinder in the z-direction, major axis a in the x-direction, minor axis 
6 in the y-direction and centre at the origin 0 

MI: MO) = M (£ + £) MO) = M (£ + £) 

MO) = 

E,: t [(‘ J + t)* j + («* + j)y 2 + («’ + ‘V] = i 

16. A solid rectangular parallelepiped with sides a, b and c parallel to the z-, y— and 
z-axes respectively and with centre at the origin 0 


MI: /„(O) = 


M(6 2 + c 2 ) 


/ w (0) = 


M(c 2 + a 2 ) 


/«(0) = 


M(a 2 -I- b 2 ) 


EI: T5 K 6 ’ + c ^ x2 + ^ + a V + ( ° 2 + fc2)z2 l = 1 


Copyrighted material 



366 Classical Mechanics 


17. A solid right circular cone of height h and radius of base r with the symmetry axis in 
the ^-direction and the base in the xy-plane with the centre at the origin O 

MI : /„(0) = /„(0) = M ^ T \l ^ /-(O) = 

El: ^ [(Sr 1 + 2 h 2 )(z* + V ‘) + 6rV] = 1 

18. A solid torus whose equation in cylindrical coordinates p, (f> and z is given by (p — 
b) 7 + z 7 = a. 7 where b > a with the centre of symmetry at the origin O 

mi : mo) = MO) = mo) = M(3a2 4 - + -^ 

EI : ^ [(5a 2 + 46 2 )(x 2 + y 2 ) + 2(3a 2 + 46 2 )z 2 ] = 1 


12.8 ANGULAR MOMENTUM IN LABORATORY AND CENTRE OF MASS 
FRAMES 


Laboratory frames are a class of idealised fixed inertial frames firmly attached to the earth 
or a laboratory in space, neglecting the effect of their rotation and acceleration, if any. For 
a colliding system, laboratory frame is usually defined to be the one in which either the 
heavier particle or the force centre is at rest. Let a system of N particles have masses m*, 
position vectors r, and velocities with the total angular momentum about the origin of 
the laboratory frame as 

N 

L = ]T Ti X Pi 

. = 1 

where = midti/dt is the linear momentum of the »th particle in the 

laboratory frame. The position vector of the centre of mass and its velocity with respect to 
the laboratory frame are defined to be 


R _ ErriiVi £>«*•» _ _ Y,Pi 

Y, m i M X) m « Yj m i M 


(12.34) 


where M is the total mass of the system. 

The centre of mass (CM) frame is, by definition, the frame in which the origin of the 
coordinate frame coincides with the centre of mass of the system. Consequently, the velocity 
of the centre of mass vanishes in the centre of mass frame. Let us denote the quantities 
measured with respect to the centre of mass frame by primes on the corresponding symbols, 
so that 


r'i = r, - R v'i = r'i = - R = v> - V p\ = - V) (12.35) 


Copyrighted material 



Rigid Body Dynamics 367 


It is easy to verify that 

Y! mitj = 0 and = ® (12.36) 

The angular momentum in the CM frame about the CM is defined to be 

L' = x Pi (12.37) 

There are two other equivalent ways of defining the same quantity, namely, 

L\ = x J>, (12.38) 

and 

L’ t = x (12.39) 

The equivalence of these three definitions can be easily established. Consider 
L' = x 4 = Y, r 'i x (Pi - m i v ) 

= X Pi ~ (Y"W<) X V = Y, r 'i * * = L 'l 

Similarly, 

£' = E< x * = E<* - *> x »: 

= E r ‘ x - * x fi = x y, = l ' 2 

Thus we get 

L' = L\ = L' 7 

However, the angular momentum in the CM frame is not equal to the angular momentum 
in the laboratory frame. To see this, consider, 

L> = x f\ = - R) X (ft - m<V) 

= x Pi - R x (£*) - (^m^) x F + R x V^m,) (124Q) 

= L - M(R x V) - M[R x V) + M(R x V) 

= L - M{R x V) 

The equation L' = L is valid if (i) the observer is located at the centre of mass, or, (ii) 
the observer is moving (with respect to the lab frame) in such a way that either the centre 
of mass appears to be stationary, that is, V = 0 or, (iii) R is parallel to V, that is, the 
system is moving radially with respect to the lab frame. 

Since a rigid body is a special case of an JV-particle system, Eq. (12.40) should apply to 
it. Eq. (12.15) tells us that the angular momentum of a rigid body with respect to a fixed 
inertial frame outside (lab frame) is given by 

L =* MR x v + y^(/,ju;j)t 


Copyrighted material 



368 Classical Mechanics 


where tt is the uniform translational velocity of the rigid body. (Such a mix-up of vector 
and tensor indices as above is allowed only if the coordinate frames involved are all of the 
rectangular Cartesian type.) Here L\ may be identified with 52jUj<*>j- However, for a 
proper comparison between Eqs (12.15) and (12.40), we must define a CM frame for the 
rigid body and this frame should also act as a body frame so that /»/s are not time varying. 
Such a frame is a body frame with its origin at the CM. Now, we can express the inertial 
velocity of any particle in the rigid body given by 

Vi - u + ui x r, 

in terms of the instantaneous basis vectors of the CM frame defined above. Note again that 
all the quantities t>*, «, and u) are measured in the lab frame (inertial frame) and r* is the 
position vector of the particle with respect to the CM. Thus the linear momentum of the 
whole body is 

P >: m,i>, ^ m,» 4- ^m,(w x r,) 

= Mu + uj x (^ mitj) 

= Mu 

since £]m,r, = 0 (Eq. (12.36)). The inertial angular momentum about the CM turns out 
to be 

L' = x Vi = y^m.r, x (ti -t u x rj) 

= MR x « + IjjWji 

= L by Eq. (12.15) 

Since the position vectors are with respect to the CM of the body, MR - Y] m,r, - 0 
so that 

L = L' = (12.41) 

*<j 

Notice that, here L and L' are both inertial quantities, L' being expressed in terms of the 
instantaneous basis vectors of the rotating CM frame and L is the same vector expressed 
in terms of the basis vectors of the fixed lab frame. Therefore in actual evaluation of L ', 
time dependence is thrown into the basis vectors of the CM frame and Uj do not vary with 
time, while for the evaluation of L, Uj vary with time and the basis vectors are fixed and 
therefore time independent. 


12.9 TORQUE AND ITS RELATION TO ANGULAR MOMENTUM 

If a system of N particles experience external forces the torque due to these forces about 
the origin of any inertial frame is defined by 

T - ^r, x = £ m ‘ ri x (1242) 


Copyrighted material 



Rigid Body Dynamics 369 


The angular momentum about the same origin is defined as before 
L = T> x mr. 

Therefore, 

dL d ^ . . 

= x f» + r m,r < x f< (12.43) 

= r 



Fig. 12.9 Position vectors with respect to the origin O of an inertial frame, 
the centre of mass G, and an arbitrary particle at A, of a system 
of particles 


This relation is the rotational equivalent of Newton’s second law of motion and is valid 
when the moments are taken about the origin of any fixed inertial frame of reference. How¬ 
ever, sometimes one has to compute the torque and angular momentum about a point which 
is not fixed, for example, about the moving centre of mass or some other fixed point of a 
body. Therefore, one must know under what conditions Eq. (12.43) is valid. Instead o| 
taking moments about the fixed origin, let us take them about any arbitrary point A (see 
Fig. 12.9). 

Consider a system of N particles. Let the *th particle sit at P and experience a total 
force Fi. G is the centre of mass and M — £ is the total mass of the system. FYom 


Copyrighted 


370 Classical Mechanics 


the geometrical constructs of Fig. 12.9 we can write 


Ti = r A + r iA Fi = rmfi 
X] mitiA = 5>(r« - r*) = M(r c - r^) = Mt G a 


(12.44) 


The resultant moment of all forces about A is by definition the torque about A, that is, 


r A = x 

= J2 ri * x + f *^) 

(12.45) 

= X m ^>*) + X m< ( r<A X 

= La + Mr G A X t a 

where in the last step we have used Eq. (12.44). Herein = Yj™* r *A x is the angular 
momentum about the point A. So if we desire to have the torque angular momentum relation 
to be of the form Eq. (12.43), the second term on RHS of Eq. (12.45) must vanish and this 
happens when the point A 


(i) has a constant velocity, or 

(ii) is the CM itself implying Tga = 0, or 

(iii) has acceleration parallel to t G a- 


One could also start with another definition of La such as 

L'a = ^ 2 r iA x pi = x f, 

= X + **) 

= L a + Mt G a x t a 

L' a may be called the absolute angular momentum about A. In this case, 
T a = L a + Mt G a x r A 

= L'a - Mt G a x t a - Mt G a x fx + Mt G a x f* 
= L'a + Mt a x tqa 
= L'a + Mt a x t g 


(12.46) 


(12.47) 


If we want Eq. (12.47) to give the relation T* = L'a, the fourth condition emerges, and 
that is, the point A (iv) has velocity parallel to that of the CM. 

The importance of this exercise is that we are now in a position to use the Newtonian 
torque angular momentum relation for studying the motion of a rigid body as a special case 
of an JV-particle system. If a rigid body is rotating in any manner, keeping one point, say 
A, fixed in space then = La will be valid under any one the above four conditions. 
A most widely used case is to choose the centre of mass of the rigid body as the origin, 


Copyrighted material 



Rigid Body Dynamics 371 


(the condition (ii) above). However one must remember that T* and La are both inertial 
quantities although they may have to be expressed in terms of the instantaneous basis 
vectors of a body frame which is rotating and translating with the body. 

In chapter I, we have seen that internal forces do not manifest in the torque angular 
momentum relation, even though they can contribute to both the total torque and the total 
angular momentum. However, for. rigid bodies, the internal forces are of the central force 
type, and therefore, do not contribute to the the torque or the angular momentum. Hence 
the torque in the above treatment can be taken to be equal to the total torque produced by 
the external forces only. 


12.10 EULER’S EQUATION OF MOTION FOR RIGID BODIES 

Consider a rigid body either rotating about a fixed point in space or moving in any manner 
with a body frame attached to the centre of mass. The angular momentum L about origin 
(in the first case the fixed point and in the second case the CM) and the instantaneous 
angular velocity u are the quantities that refer to a fixed inertial frame. When expressed 
in terms of the unit vectors of the rotating body frame, these are related as 

Li = UjWj 


The external forces F and the torque I* about the origin, are also defined with respect 
to the inertial frames only. Therefore, Newton’s second law of motion, that is, 



and 


F = 


dP 

dt 


where P is the total linear momentum of the rigid body, will be valid if d/dt is taken with 
respect to a fixed inertial frame only. We know from Eq. (3.1) that 


SL"SL +,,x 

is the relation between the time derivatives of a vector, taken with respect to a fixed and a 
rotating frame of reference respectively, if the latter rotates with an instantaneous angular 
velocity u. 

Therefore, 

'-(SL- 

and 

M?L- 

where or, L, P, F and T are all expressed in terms of the instantaneous unit vectors 



(12.48) 


(12.49) 


Copyrighted material 



Rigid Body Dynamics 373 


torque. Since T = 0, Eq. (12.48) becomes 


dL 

Tt +" * L = 0 


Taking a scalar product with X, one gets 
dL 


X • — + £•(« x i) = 0 
dt 




that is, 


Jt V) = 0 


X 2 = const. 


(12.51) 


(12.52a) 


Thus for T = 0, even though dL/dt = X x ^ 0 (unless u is parallel to X), it 
is seen that X 2 , or the magnitude of X remains constant. This is not surprising, because 
the X vector is actually fixed in space, (as I* = 0), but with respect to the body frame, 
only the direction of X appears to change without, of course, bringing in any change in its 
magnitude. 

Again, if we take a scalar product with u on both the sides of Eq. (12.51), we obtain 


dL 

u) • — + w • (w x X) = 0 
at 


dL n 
*.- = ° 


dT 

dt 


that is, 


T = const. 


(12.526) 


Therefore, for freely rotating rigid bodies u and dL/dt are perpendicular to each other 
and the rotational kinetic energy T is a constant of motion. Thus there are two constants 
of motion, X 2 and T for rigid bodies rotating in any manner about a fixed point. 

Now there are mainly two ways of dealing with the dynamics of the free rotation of any 
rigid body. One way is to follow Poinsot’s geometrical construction (proposed by Poinsot in 
1834), and the other is the analytical method of Euler. Of course, the Lagrangian approach 
would be yet another method of tackling the same problem. We shall see how these different 
methods enlighten the complementary aspects of the same problem. 


12.13 POINSOT’S METHOD OF GEOMETRICAL CONSTRUCTION 

Let us assume that a rigid body is rotating freely, that is, without any external torque acting 
on it, about a fixed point, say its centre of mass. Hence, the kinetic energy of its rotation, 
T, is a constant of motion. From the equation 

T = -lijViWj 


Copyrighted 



374 Classical Mechanics 


where T is a constant and the summation over *, j are implied, we can write 


' J y/rrV2T ~ 


(12.53) 


This equation looks like the equation of the ellipsoid of inertia provided we identify 
ut/y/2T with x in Eq. (12.24), that is, 


U)i 

Xi = 7Tr 


(12.54) 


Now, since T is constant, the surface of this inertial ellipsoid plotted in terms of the 
position vector x as defined above, must represent a surface of constant T, and by definition 
of x the direction of x is the same as that of the instantaneous angular velocity u. The 
distance from the centre of this inertial ellipsoid to any point on its surface is simply |w|/v^T. 
This inertial ellipsoid is depicted in Fig. 12.10. 



Pig. 12.10 Poinsot’g geometrical construction for the motion of inertial ellip¬ 
soid in the w-space 


Let us elaborate on this new model of the inertial ellipsoid. It is an inertial ellipsoid 
because its equation contains all the elements of the inertial tensor. This inertial tensor 
is about the same point as the fixed point in the body about which the free rotation has 
been assumed to take place. If we fully expand Eq. (12.53), we can see that the space in 
Which this equation stands for an ellipsoid is the space of instantaneous angular velocity. 
The higher the amount of rotational kinetic energy, the bigger is the size of this inertial 
ellipsoid. For a given constant kinetic energy of rotation, the longest axis of the ellipsoid 


Copyrighted 



Rigid Body Dynamics 375 


(that is, the highest value of the instantaneous angular velocity) should correspond to the 
direction in the actual body of the smallest moment of inertia, which has to be one of the 
principal axes of the body about the point under consideration. This is sensible because 
if the body is rotating about an axis corresponding to the least moment of inertia, it has 
to rotate very fast (which means that it should point along the longest radius vector of 
this ellipsoid) in order to maintain the same kinetic energy of rotation. So if we choose 
any arbitrary point P on the ellipsoid and say that this point represents the instantaneous 
dynamical state of the rotating rigid body, it would immediately mean that the radius vector 
OP which corresponds to a unique direction in the actual body (because the principal axes 
of MI of the body are in some definite manner aligned with the geometrical axes of this 
ellipsoid) is the direction about which the body is currently rotating, and the length of OP 
is simply |u>|/v/2T. If the freely rotating body changes the direction of u> with time, as it 
usually should, the point P would be shifting on the surface of the ellipsoid, but it cannot 
go outside this surface because it maintains a constant rotational kinetic energy. 

Now from the standard expression for T 

9T _ . _ , 

8ui ~ U)Ui ~ L ‘ 

A geometrical interpretation of this result in terms of the above inertial ellipsoid would be 
that the angular momentum vector 

L = V W T = -4=V T (12.55) 

Avr * 7 v 7 


where * is defined through Eq. (12.54). By Eq. (12.55) we see that L is directed 
perpendicular to the surface of constant kinetic energy depicted in the space of angular 
velocity (note that x <-* «). Since the surface of the above kind of inertial ellipsoid 
represents a constant T surface, the normal to this ellipsoid at the point indicates the 
direction of the angular momentum vector L. Again, since this is a case of free rotation, 
L is fixed in space, in direction and magnitude. But with respect to the body frame, 
the direction (but not the magnitude) changes as the body rotates. As the body rotates, 
its inertial ellipsoid must also rotate with it keeping, however, its origin fixed. Since the 
directions of the normal and the radius vector at any arbitrary point P on the ellipsoid are 
not parallel, the directions of L and oj are also not parallel. This is all expected. For a 
spherical top, the moment of inertia tensor reduces to a single scalar, the inertial ellipsoid 
becomes spherical, and hence L and ut must point in the same direction. 

Let us now draw a tangent plane to the surface of the inertial ellipsoid at the point P (see 
Fig 12.10). The perpendicular distance of this plane from the origin of the inertial ellipsoid 
is given by 

ON = OP cos(Z«,L) 


u>| u) • L y/2 T 

v/2TM \L\ = nr 


If ON is fixed for all time, the t^Jigent plane is also fixed in space for all time, that is, it 
must serve as an immovable or invariable plane for the dynamical study of free rotation. It 


Copyrighted 



376 Classical Mechanics 


does not change with time, although the inertial ellipsoid would change its orientation with 
time. In other words, the ellipsoid of inertia of any rigid body under free rotation always 
touches a fixed plane, known as the invariable plane , the perpendicular to which drawn 
from the origin points always toward the fixed direction of the angular momentum of the 
body. This was the geometrical picture of a freely rotating rigid body suggested by Poinsot 
in 1834. 

The fact that both L and the invariable plane are fixed in space has the following 
implication. Since the invariable plane has to meet tangentially the the inertial ellipsoid 
at some or other point, and since this point of contact determines the direction of the 
instantaneous angular velocity of the body (that is, the u> vector), and since u) changes 
with time, this point of contact on the surface of the inertial ellipsoid ought to change with 
time. However, we know that the invariable plane has to remain fixed in space for all time, 
and therefore the point of contact can change only if the inertial ellipsoid itself changes its 
orientation with time (keeping its origin fixed). Further, since u) changes continuously with 
time the inertial ellipsoid has to roll over the invariable plane without slipping. Note that 
every point on the inertial ellipsoid corresponds to a different direction of u with respect 
to the body frame. 

As the ellipsoid of inertia rolls without slipping over the invariable plane, the point of 
contact P between the ellipsoid of inertia and the invariable plane indicating the direction 
of instantaneous w with respect to the body as well as the fixed plane, traces a curve on 
the ellipsoid of inertia, called a polhode. Similarly, a curve is also traced on the invariable 
plane and is called a herpolhode. 

12.13.1 Nature of Polhodes and Herpolhodes 

The ellipsoid of inertia has got three principal axes. If the principal moments of inertia are 
labeled as A < B < C, the corresponding principal axes i\, x 2 , *3 would correspond to 
the semiaxes satisfying a > b > c for its ellipsoid of inertia. Now if the body is rotating 
with some angular velocity u> about the principal axis * 1 , then u> must pass through Z\ 
and the ellipsoid of inertia must touch the invariable plane at the tip of x\ = a. Since 
x\ = a is the farthest point on the ellipsoid from its centre, and since the invariable plane is 
fixed, the ellipsoid of inertia cannot move without detaching itself from the invariable plane. 
Hence only one solution is permissible, which is that u will never shift from the direction 
of its principal axis 7t \. Also at this point the normal to the invariable plane coincides with 
the radius vector so that L is parallel to o>. Thus, the body rotates about its principal 
axis corresponding to the smallest principal moment of inertia, in quite a stable manner. 
A similar argument is also valid for the third principal axis having the largest principal 
moment of inertia (and hence the shortest semiaxis for its ellipsoid of inertia). The rotation 
about this axis is also stable, with the direction of L and u> remaining coincident all the 
time. The rotation about the the intermediate principal axis is, however, very unstable. 
Having the length of its semiaxis intermediate between the two extreme ones, the point 
of contact can freely pass through this axis and the direction of u> can change both with 
respect to the body and with respect to any outside inertial frame. 

If the direction of the instantaneous angular velocity, u, is somewhat away from the 


Copyrighted 



Rigid Body Dynamics 377 


first and the third axes, the polhode is a closed curve encircling the nearest pole of the 
principal axes. Only at the pole of the intermediate principal axis, and at no other point, 
can two polhodes intersect. Polhodes are depicted in Fig. 12.11 on the surface of the inertial 
ellipsoid. 



X, 


Fig. 12.11 Polhodes drawn on the surface of the inertial ellipsoid 


Herpolhodes are drawn on the invariable plane as loci of the tip of the instantaneous ut 
vector. From Fig. 12.11 one can see that 

NP 2 = OP 2 — ON 2 — ~ ~jj 

Since L and T are constants, and u; is a measure of the radial distance from the centre to 
a point on the inertial ellipsoid, J 1 and hence NP 2 are bounded. Therefore, herpolhodes 
are in general bounded rather than closed curves (see Fig. 12.12) 

12.14 ANALYTICAL METHOD OF EULER FOR FREE ROTATION AND 
THE THIRD INTEGRAL OF MOTION 

In this method one usually starts with Euler’s equation for rotational motion (Eq. (12.51)) 
which provides us with the two Eulerian integrals of motion, namely L 2 — constant and 
2T = L u) = constant, for any rigid body freely rotating about a fixed point, u> and L 
being expressed in terms of the unit vectors of any suitable body frame of reference. 

Euler’s equations of motion for a freely rotating rigid body referred to a body frame are 
given by 

+ u> x L = 0 (12.51) 

at 

If we choose the principal axes of the rigid body about thp origin as the rectangular Cartesian 
axes of. the body frame and the principal moments of inertia as A ± B / C, then with 
<i> = (wijU^jWj) and L = (Au)\,Bu> 2 ,Cvz) with respect to the principal axes, Euler’s 


Copyrighted material | 


378 Classical Mechanics 



Pig. 12.12 Hcrpolhodes drawn on the invariable plane 


equations of motion become 

A<jj\ — (B — C)u) 2 UJ 3 

Bui 2 = ((7' — i4)w3u;i (12.56) 

Cu>3 = (A — B)u)\Ul2 

supplemented by the two Eulerian integrals of motion 

L 2 = A 2 uif + B* u)\ + C 2 ui\ (12.57) 

and 

2 T = Au)\ + Bui\ + Cu) | (12.58) 

One can eliminate u ; 3 from Eqs (12.57) and (12.58) and from the resulting equation one 
can write w 2 as a function of wj. Substituting this u > 2 = u^O^i) in either of Eqs (12.57) 
and (12.58) one obtains u > 3 as a function of «i, that is, u >3 = ^(wi). Plugging these 
expressions for ui 3 and u ; 2 in the first of Eqs (12.56) one gets a first order differential 
equation in wi only, the general solution of which is available in the form of an elliptic 
integral. Repeating similar procedures for u ; 2 and u ; 3 one can find the complete solutions 
for u>i, w 2 and u > 3 as functions of A , B , (7, L 2 , T and the constants of integration or the 
initial value of u). 

Without going through the details of the solutions, one can, however, see that the rotation 
about the principal axes corresponding to the largest and smallest principal moments of 
inertia are stable and the remaining solution is unstable. Let u;i, u > 2 and u ;3 correspond 
to the components along the axes for which we have the principal moments of inertia A < 
B < C respectively. A particular solution such as u>i = = 0 and W 3 ^ 0 satisfied 

by (12.56) suggests that ti >3 = 0 oral) = const. The stability of such a solution can, 
however, be judged by allowing a small perturbation in u>\ and w 2 and noting how the 
perturbation evolves with time. For A < B < C, it can be shown that the perturbation 
grows with time for the initial conditions u)\ -* 0 , u ; 3 -♦ 0 and u ; 2 ^ 0 . 

Therefore it is clear that if there are only two integrals of motion, they alone cannot give 


Copyrighted 


Rigid Body Dynamics 379 


solutions to u>i, u/ 2 and u>j simultaneously. One then has to solve Euler’s equations of 
motion quite explicitly in order to obtain the full solution. This is the reason why it is 
so important to have a third integral of motion for motions of the rigid bodies. It requires 
introduction of further symmetries to the problem. As we shall see later, even in the presence 
of gravitational torques, the rotation of symmetric tops about a fixed point in the body has 
two obvious integrals of motion. Long searches for the third integral of motion revealed 
the existence of the Poincar6 integral (1892), Hess integral (1890), Kowalevski integral 
(1889), Tshapliguine integral (1901), and so on, under different restrictions imposed upon 
the symmetry of the body and in the choice of the fixed point about which the rotation 
should take place. Some of these are left as exercises (see problem 12.12). 

In the next section we are planning to use the above analytical method for studying the 
free rotation of a symmetrical rigid body, such as the earth, for which the principal moments 
of inertia are A = B < C. The c-axis, which is the axis of the largest moment of inertia, 
is approximately the same as the axis of its geometrical symmetry. We show that the torque 
free rotation of the earth leads to a kind of wobbling in space, historically known as the 
Chandler wobbling of the earth, duly named after S. C. Chandler, who did the pioneering 
observational work on this phenomenon about a hundred years ago. 


12.15 CHANDLER WOBBLING OF THE EARTH 

The free rotation of a symmetric body such as the earth having its principal moments of 
inertia A = B < C is described by Euler’s equations of motion (Eq. (12.56)). Defining 
k = (C - A)/A, these equations become, 

wi + ku> 2 Ws = 0 a >2 - ktoiwa = 0 and W 3 = 0 (12.59) 

where wi, u >2 and are the components of u along the principal axes of the MI ellipsoid. 
The third equation is easily integrable to 

u >3 = const. 

that is, the component of the angular velocity in the direction of the inertial symmetry axis 
of the earth does not change with time. 

Using the fact that <*>3 is constant and differentiating the first two of Eqs (12.59) with 
respect to time we get 

&i = - ku)zW2 = - (k 2 f sVi 
U>2 = kw 3<I>1 = — (A: 2 u>3)u>2 

The general solution of Eqs (12.60) can be written in the form 
wj ± uj 2 = e ± i(ku3t + •> 

Here a u and 8 are two constants of integration. Since 

w* 4- u»| = (u;i 4- — tw 2 ) = O-u 


(12.60) 

(12.61) 


Copyrighted materi 



380 Classical Mechanics 


we have 

to 2 = + u/f + u >3 = a 2 + u>l = const. (12.62) 


because a„ and u >3 are both constants of motion. Therefore, the tip of u) (whose magnitude 
remains constant in time) is precessing about the inertial symmetry axis of the earth with 
<43 remaining constant. 

Let us define 


tana6 


>/ fa> 2 + 

a>3 





(12.63) 


Obviously a 6 is the angle between u> and the symmetric axis. Since this angle remains 
constant, the direction of the vector u must describe a cone of constant semiangle at about 
the inertial symmetry axis of the earth. 

From Eq. (12.61), the angular velocity of precession of u) about the symmetry axis of 
the body is given by 

) UJJ (12.64) 


fl» = b* = 


For the earth, it is known that A = B = 0.329591 Mo 2 , C = 0.330674 Afa 2 , giving 
k = (C — A)/A ~ 1/304.4. This gives, with the knowledge of W 3 for earth as 7.292115 
x 10 -5 rad/s, for Q& a value of 2.39529 x 10 -7 rad/s, which corresponds to a period of 
about 303.6 days. 

Thus from the solution (12.61) it is apparent that the tip of the earth’s angular velocity 
of rotation should describe a conical motion about the geometrical symmetry axis of the 
earth and that each revolution should take about 304 days. The sense of rotation of the 
tip is the same as that of the product ku> 3 . Since both k and u ;3 are positive in the case 
of the earth, the tip of u> should move in an anticlockwise manner about the inertial or 
geometrical north pole of the earth when viewed from above the north pole. 

In the second half of the last century, S. C. Chandler from Cambridge, U.S.A. collected 
the observational evidences for such a free wobbling of the earth. He found that the earth’s 
axis of rotation is precessing about the geometrical pole of the earth at a separation of only 
a few meters and that the period of wobbling is not 304 days but much longer, about 435 
days. For this measi rement he did not have to go to the north or south pole of the earth. 
What he had to do was to measure very carefully the variation of the geographical latitude 
(same as the local altitude of the pole) of any place over a few years. From very accurate 
astronomical observations of the altitude of reference stars he found that the amplitude of 
wobbling in the latitude of a place was about 0.1 to 0.3 seconds of arc, a typical plot for the 
polar wandering being shown in Fig 12.13. The wobbling does not seem to be very regular 
and the average period over the past one hundred years has now been found to be about 
433 days. Nevertheless the observed period of wobbling grossly disagrees with the expected 
period. 


This discrepancy in the period of wobbling is believed to be due to the plasticity of the 
earth’s interior, seasonal movement of the atmospheric masses, oceanic currents, response 
of the whole earth to the tidal forces due to the sun and the moon, etc., each causing a 
semi-irregular change in the moment of inertia of the earth. A proper quantitative analysis 


Copyrighted material 


Rigid Body Dynamics 381 


(Greenwich) 


Y' 



Fig. 12.13 Small wandering of the true inertial north pole of the earth about 
its mean position P 0 


of all these effects is such a formidable task that even today we cannot claim that this 
discrepancy has been exactly explained. 

12.10 MOTION OF u IN SPACE FOR FREE ROTATION 

We have already found that, with respect to an observer fixed to the body, the instantaneous 
u describes a cone about the inertial symmetry axis of the body with the semiangle 

= cos "‘ 0) 

and with a period 

2ir _ 2r A 

~ ~n ~ ViC - A 

This cone may be termed as the body cone as it is traced on the rotating body. 

Now let us see how an observer fixed in space would view this motion of u. In a fixed 
inertial frame, the vector angular momentum L is fixed, as the net applied torque is zero. 
We also- know that 

2T = v ■ L = const. 


Copyrighted material! 



Rigid Body Dynamics 383 


it is easy to check from Eq. (12.63) that 


This means that if both a and a» are acute angles, then 


and 


a > at 

for 

C < 

A 

a < 

for 

C > 

A 



C < A 
Prolate spheroid 
(a) 



Retrograde 


k 

Oblate spheroid 
(b) 


Fig. 12.14 Motion of body cones about the fixed space cones, showing the precession of both 
u> and k about the direction of L, for two cases (a) prolate spheroid and (b) oblate 
spheroid 


So for a prolate spheroid, C < A, and hence a = + a,, that is, the two cones 

must touch externally as shown in Fig. 12.14a. On the other hand, for an oblate spheroid, 

such as the earth or a coin, C > A, and therefore, a = at - a,, that is, the space 

cone being smaller than the body cone should appear to touch on the inside surface of the 

body cone (Fig. 12.14b). However, it is practically impossible to insert the solid space 
cone inside the solid body cone. So, for Fig. 12.14b, a possible configuration would be to 
have the body cone as a hollow cone and the solid space cone touching from the hollow 
side of the body cone. Such a hollow body cone must have its axis pointing downward 
if, by definition, a solid space cone has its axis pointing upward ( L ). With this revised 
configurational interpretation, now a can have only one choice, namely, a = at, + a t , 


Copyrighted 




384 Classical Mechanics 


with a, always acute angled, because physically the direction of the angular momentum 
vector L can never be very far from the direction of uj. So at can remain acute angled 
for Fig. 12.14a but has to be obtuse angled for Fig. 12.14b. These rearrangements are 
perfectly consistent with the formula tan 2 a = (A 2 /C 2 ) tan 2 o^, which implies a > at, 
or a = at + a,, for both the cases of C < A and C > A. In the latter case, both 
a and Qrj, are obtuse angled (see Fig. 12.14b). The solid arrows near the circumference of 
the body cones represent the sense of its actual rotation and the dotted arrows indicate the 
sense of actual motion of u) along the periphery of the respective body cones. The space 
cone remains fixed all the time and the body cone merely rolls on it without slipping. When 
C < A, the processional motion of u> around L is anticlockwise if the sense of rotation 
of the body cone about its own inertial axis of symmetry k is anticlockwise. Hence the 
motion is called direct But when C > A, this is just the opposite and the sense of the 
precessional motion of u> around L becomes retrograde. 

Obviously, for the free rotation of the earth it would be a case of retrograde precessional 
motion. Since the general direction of both u> and L for the earth is towards the north pole, 
and both a and at have to be obtuse angled, the inertial symmetry axis k of the earth 
should be directed along the south pole rather than the conventional north pole. From this 
point of view, the earth’s axial rotation with respect to the above k direction is clockwise. 
In Fig. 12.14b, the clockwise rotation of the body cone about the k direction would give 
an anticlockwise precession of u around both the geometrical north pole and L. 

One can now see the conceptual correspondence between the set of polhode and herpol- 
hode for free rotations of symmetric tops, and the set of the body and the space cones. It is 
the body cone that is responsible for the tracing of the polhode on the inertial ellipsoid, and 
the space cone is responsible for the tracing of the herpolhode on the invariable plane. Since 
the space cones and the body cones are closed circles, the polhodes and the herpolhodes are 
also the closed circular tracings for symmetric tops. As the inertial ellipsoid rolls without 
slipping on the invariable plane with u passing through the point of contact , the body cone 
also rolls on the surface of the space cone with u; passing through their common tangent. 


12.17 WHY SHOULD A FREELY ROTATING BODY PRECESS AT ALL? 


Mathematically speaking, for a free rotation of a rigid body, one has 



inertial frame 


rexternal 


0 



body frame axes 


+ Ui X 


XI 

I body frame axes 


This extra u) x L term, which may be regarded as a pseudotorque arising out of the 
noninertialness of the rotating body frame, is found to be responsible for the precession of 
the freely rotating rigid body. This is mathematically secured through keeping L 2 and L io 
constant but neither of the vectors L and w conserved, as seen from the body frame. 

We know that with respect to any rotating frame, a number of pseudoforces arise, namely 
the centrifugal force due to the circular motion of the particle tied rigidly to the rotating 


Copyrighted 



Rigid Body Dynamics 385 


body, the Coriolis force due to any motion of the particle with respect to the rotating body 
frame and the Eulerian force due to any inertial change in u with time. Thus the particles 
of the rotating rigid body must experience the centrifugal force and the Eulerian force but 
not the Coriolis force because the particles are fixed in position with respect to the body 
frame. The total centrifugal torque experienced by the whole body is therefore given by 

Tccntrifagal = X X f) X «] 


= y; m(r • u>)(u x r) - ym[r-(o> x r))u; 

The second term on RHS vanishes because r is perpendicular to u> x r. Therefore, 

^centrifugal = ^ ] TTl{T • X r) (12.67) 

where the sum extends over all particles of the body. 

In order to study the nature of this centrifugal pseudotorque, we choose the z-axis of the 
body frame to be along the instantaneous ta vector, that is, u = u 3 k in which case, from 
Eq. (12.67), T, = 0, but 


Ti = - y mr 2 r 3 w! = J 23 wj 

r 2 = 'YjTnr i r l ii)l - -/ 3 iu>| 

Therefore, 

^centrifugal = - / 31 j). (12.68) 

This shows that the centrifugal torque does not vanish if the products of inertia of the body 
with respect to the chosen body frame axes (that defines the k axis along the instantaneous 
axis of rotation) do not vanish. Or in other words, the rotations of the body about any of 
the principal axes will lead to the vanishing of the centrifugal pseudotorque. Hence such 
rotations being torque free, are expected to be in steady states, as we have seen earlier. 
Now, consider 


or 


L x u> 

body frame 


X p) X u 

y [mr x (w x r)] x u 
- y mu x {r x (w x r)} 
y m(o> • r)(o> x r) - ym[w-(o> x r)]r 
y m(w -r)(u x r) 

r - dL \ 

* centrifugal — ,. 

at I body frame 


^centrifugal + U X L = 0 

body frame axes 


Copyrighted 



386 Classical Mechanics 


or 

+ u> x L\ = € (12.51) 

body frame axes I body frame axes 

which gives back Euler’s equations of motion for a freely rotating rigid body. One can also 
verify that the total pseudotorque due to the Eulerian force mr x (r x u>) is precisely 
equal to — dL/dt in the body frame; hence the sum of the two pseudotorques vanish in 
the absence of any external torque. 

Similarly one can calculate the total amount of the centrifugal force that is experienced 
by the particles of the rotating rigid body. 

-^centrifugal = ^ ^ T7l(u) X r) X U 

= — 53 mu, X ( w x r ) 

= 53 mu > 2r — 53 ' r ) W 

= uP'MR c m — M(u> ■ Rc m )u 

Therefore, ^centrifugal can vanish only if Rcm = 0, that is, if the rotation is taking 
place about the centre of mass of the rigid body. Otherwise, if pure rotation is taking place 
about a point other than the centre of mass , then 

Pcentrifugal = 53 X r ) X U 

= 53 mV x W = P cm x 

= -W X P cm 

Therefore we get, remembering that 

P _ rfi’cml 

" centrifugal == —I 

I body frame 

+ u X P cm = 0 (12.69) 

at 

which is, again, Euler’s equation for the motion of the centre of mass of any rotating rigid 
body, expressed in the unit vectors of the rotating body frame. 


dL 

dt 


12.18 STEADY PRECESSION OF A UNIAXIAL BODY (SYMMETRIC TOP) 
UNDER THE ACTION OF AN EXTERNAL TORQUE 

Let k be the unit vector along the symmetry axis of a uniaxial body say a ‘top’ (see Fig. 
12.15) which makes an angle a with the inertial 2 -axis, the latter being aligned in the local 
vertical direction. O is the bottom tip of the body about which the body is rotating with 
an instantaneous angular velocity u>. The distance of the centre of mass of the body (which 
is the same as centre of gravity) from O is h. 


Copyrighted material 



Rigid Body Dynamics 387 



Fig. 12.15 A symmetric top under the action of gravitational torque about 
a fixed point O in the body 


If the body’s symmetry axis, that is, the direction of k is changing with time with respect 
to the inertial frame, the tip of k has a motion with respect to the inertial frame, given by 
(see Eq. (3.2)) 


dk 

dt 


hi x i 


so that 


i X ^ = i X (w X i) = HI - (at ■ k)k 


at - wj k 


Therefore, the angular velocity of the body at is given by 


L dk L 

at = k x — + u; 3 Jk 
dt 

(12.70) 

Hence the angular momentum L is 


L = A x + Cuik 

(12.71) 

and the total torque about the origin 


^ dL s (Pk dk . l ' 

r=- = Akx w + c u , Tt + w 

(12.72) 

The angular momentum, torque ^nd the angular velocity are all referred to the inertial 
frame. 


Copyrighted material 



Rigid Body Dynamics 391 



Pig. 12.16 Roll, pitch and yaw motions of an aeroplane 

Roll is the rotation of the body about the longitudinal direction of the motion. For 
example, while swimming straight ahead if one decides to turn one’s posture sidewise from 
supine to prone and back to supine, one has to perform a complete roll. 

Pitch is the rotation about the transverse horizontal axis of the body. When one decides 
to dive up and down, so that one’s nose goes up and down, or say a car wants to climb up 
or down on a hilly road, the rotation involved would be a pitch. 

Yaw is the rotation about the transverse vertical axis of the body, so that the nose of the 
aircraft or swimmer can move sideways. 

All these three basic types of rotations are essential for manoeuvring the orientation of 
any aircraft or kite, etc. with respect to the general direction of its translational motion. 
Since these three basic rotations are mutually independent of one another, and by Euler’s 
theorem one requires three degrees of freedom for general rotation, Euler had defined the 
the so-called three Eulerian rotations in terms of roll, pitch and yaw. Thus any arbitrary 
spatial orientation of any rigid body can be thought of as a linear combination of yaw, pitch 
and roll about the centre of mass, or any given fixed point in the body. 

A similar statement is also valid for the instantaneous angular velocity of any rigid body, 
that is, the latter can be thought of as a linear combination of the angular velocities due to 
yaw, pitch and roll. In order to see this, we prove in the following section a basic theorem 
on the vector addition of two angular velocities about a given point. 


12.20 ADDITION OF TWO ANGULAR VELOCITIES 

If a frame S 2 has an angular velocity a> 2 with respect to Si about their common origin O, 
and the frame Si has an angular velocity o>i with respect to an inertial frame S c about 
the same common origin, then S 2 has an angular velocity of u = + « 2 with respect 

to the inertial frame S„. 

To see this, let P be a point rigidly fixed to S 2 . The velocity of P in S 2 is obviously zero, 


Copyrighted material 



392 Classical Mechanics 


that is, v 2 = 0. The velocity of P in Si is 

tl| = *2 + I1I2 X OP = W2 X OP 

The velocity of P in S 0 is 

»o = «i + «i x OP 

= w 2 x OP + wi x OP 
= (wi + u> 2 ) X OP 

This must be equal to w x OP, if u is defined to be the angular velocity of P in S 0 . Since 
P is an arbitrary point, we must have 

u) = U\ + u) 2 (12.79) 


12.21 EULERIAN ANGLES 

Let us consider a fixed inertial frame S„ with its 2 0 -axis pointing vertically upward. First 
we give a yaw (p about the 2 c -axis so that the frame S D changes to Si with their z a -z\ 
axes common and the axes x 0 and y„ are rotated by an angle (p to assume the position of 
Xi and 1/1 axes in Si (see Fig. 12.17). Then we give a pitch 0 about the new Zi-axis so 
that the frame S 2 now has xi-x 2 as the common axis, called the line of node , and y 2 -axis 
separates from yi-axis and z 2 -axis from 21 -axis, each by the same angle 0. Then we give a 
roll (or spin) about the 2 2 -axis by an angle ip, keeping 23 and z 2 axes coincident, but the x 2 
and y 2 axes rotated by ip, in their common plane, to the new X 3 and y 2 axes respectively. 
This forms the frame S 3 . Usually the frame S 3 is identified with the body frame and S D 
with the inertial frame. To compare them with the spherical polar coordinates ( r,0,<p) one 
can easily see the total resemblance between the two 0 ’s and the two <p' s, and the radial line 
(r) having a roll about itself by an angle ip, which cannot be realised in any usual spherical 
polar coordinate system. It is indeed a replacement of the length-like r-coordinate of the 
spherical polar coordinate system by an angular coordinate ip in the Eulerian description. 

Now consider the position vector of any given point P having coordinates ( x,y,z ) with 
respect to a rectangular Cartesian coordinate system. Suppose this coordinate system is 
rotated about its z-axis through an angle 0, so that with respect to the rotated coordinate 
frame the coordinates of the same point P become (z',y', 2 '). Denoting X = (x,y,z) and 
X 1 = (x',y', 2 ') we have the relation in matrix notation 

X' = R z {0) X 

where 

(10 0 \ 

R z (0) = 0 cos 0 sin0 (12.80) 

\0 - sin 0 cos 0) 


Copyrighted material 



Rigid Body Dynamics 393 



Fig. 12.17 Euleri&n angles (<f>, 8, xp) as angles of elementary rotations of a rectangular Carte¬ 
sian frame of reference in the sequence S n —* Si —» Sj —* S3 


If fully expanded, this transformation would look like 


x\ = Xj 

x' 2 = ®2 cos 6 + a? 3 sin 0 
x 2 = - X 2 sin 6 + x 3 cos 6 


Similarly, if we made a rotation by an angle 6 about the y-axis instead of about the z-axis, 
we would have got 

X' = Ry^X 


with 


( cos 0 0 - sin 0 \ 

0 10 
sin 6 0 cos 6 ) 


and for the rotation about the z-axis through an angle 6 would give 


X' = R Z X 


with 


( cos 6 sin 6 0 \ 

— sin 0 cos 6 0 ] 

0 0 l) 


(12.81) 


(12.82) 


Copyrighted mai 


394 Classical Mechanics 


We can easily write the transformation matrices for the three Eulerian angles as 

( cos <f> sin <f> 0 \ 

— sm<f> cos <j> 0 I 

0 0 1 / 

(10 0 \ 

0 cos 0 sin 0 (12.83) 

\ 0 — sin 9 cos 9 ) 

( cos tl> sin ^ 0 \ 

— sin0 cost/} 0 I 

0 0 1 ) 

The first one obviously corresponds to a rotation by 4> about the z 0 -axis, the second one to 
a rotation by 9 about the first intermediate or the nodal xi-axis, and finally the third one 
to a rotation by tf> about the body’s Z 3 -axis. 

When these three Eulerian rotations are performed in the sequence described above to 
give a final rotated frame S 3 , the components of a vector in the S 0 and S 3 frames are 
related by 

< X'" = R(<t> y d,rt>)X (12.84) 

where the Eulerian rotation matrix , 

W,*,0) = *.(*)•*,(*) •*.(*) 

cosrf)cos<j> — cos 8 sin <f> sin ip cos V* sin <f> + cos 9 cos 4> sin tp sin ip sin 9 
— sin 0 cos ^ — cos 6 sin 4> cos — sin ip sin 4> + cos 0 cos 4> cos 0 cos tj> sin 6 
sin 6 sin <t> — sin 9 cos <f> cos 9 

Since all the matrices are orthogonad, their inverses are simply equal to their transposes. 
We can therefore write 

X = R(<t>,0,xl>)X'" (12.85) 

where R(<f>, 6 ,0) is the transpose of R(<j>, 0 , xl>). 

It is easy to see that the elementary Eulerian rotations are independent of each other in 
the sense that none of these can be effected by using the other two. We have seen that a 
rigid body has exactly three rotational degrees of freedom. Thus the three Eulerian angles 
() can be used as generalised coordinates to fix the orientation of a rigid body with 
respect to a given coordinate frame. In other words, if a coordinate frame with a set of 
rectangular Cartesian coordinates (x M, ,y w ,z w# ) is obtained by rotating a frame having the 
rectangular Cartesian coordinates (x, y, z ) about any arbitrary direction passing through the 
origin and through an arbitrary angle, it is possible to find a unique set of three Eulerian 
angles (<£, 9, x!>) such that the components of any given instantaneous position vector in both 
the frames are mutually related through the transformations (12.84) and (12.85). 




*.(*) 

Rr(9) 


Copyrighted material 



396 Classical Mechanics 


*i = *2 j\ = cos 0j 2 — sin Bki ki = sin 0j 2 + cosOfo 
Between S 2 and S 3 : 

t 3 = cos 0*2 + sin 0/2 j 3 = — sin 0*2 + cos 0 j 2 £3 = £2 

*2 = cos 0*3 - s\ntj>j 3 = sin 0*3 + cos 0j 3 £2 = £3 

These are simply the vectorial representations of the transformation matrices given in Eq. 
(12.83). 

Now suppose we have a rectangular Cartesian frame S, and rotate the frame by a finite 
angle x about an axis that is represented by the unit vector n and that passes through 
the origin to obtain it as a frame S'. Any arbitrary position vector r in the frame S would 
read in the frame S' as 

r —» r' = rcos \ + n(rn)(l - cosx) + (r x n)sinx (12.89) 

This equation was possibly first derived by Gibbs in 1901. Here the angle x represents, 
by Euler’s theorem, the single composite angle of rotation for the operation of the three 
elementary Eulerian rotations in succession represented by the single vector Eq. (12.84), 
where X corresponds to r and X"' to r'. One can solve uniquely for x» R i and n .2 from 
these equations in terms of 0, 6 and 0, and of course, 713 will be determined from the 
normalisation condition nf 4- n\ + n 3 = 1 , where n = (nj,n 2 ,n 3 ). 


12.22 MOTION OF A HEAVY SYMMETRIC TOP ROTATING ABOUT A 
FIXED POINT IN THE BODY UNDER THE ACTION OF GRAVITY 

Let the principal axes about the fixed point of rotation of a symmetric top be chosen as the 
body frame axes (see Fig. 12.15) so that 

I 1 1 = A = I 22 = B and / 33 = C 

In terms of the Eulerian angles and their time derivatives, 0 , B and 0 , the instantaneous 
angular velocity of the top about the fixed point of rotation is given by 

u> = uj\i + <*>2 j + W 3 k 

where t, j and k are the unit vectors along the principal axes (that is, the body frame 
axes) and therefore, vi,u >2 and u 3 are given by Eq. (12.88). 

There are two ways of setting up the equations of motion. 

12.22.1 The Lagrangian Method 

One constructs the Lagrangian of the system from the expressions for kinetic and potential 


Copyrighted material 



Rigid Body Dynamics 397 


energies. We have, for the kinetic energy, 

T = llijuiuij 

= l -Au\ + ±Au>l + ^Cu)\ (12.90) 

= ^A (^ 2 + 4> 2 sin 2 0^ + t^C (j>cosO + ip^j 
and the potential energy 

V = Mgh cos 0 (12.91) 

where h is the distance of the CG from the point of support (the origin). 

Therefore, the Lagrangian of this conservative system is given by 

L = ±A(e 2 + j> 2 sin 2 0^ + (j>cos0 + ip)* - Mghcos0 (12.92) 

This Lagrangian has three cyclic variables, namely t , <f> and ip., Hence the total energy 
E and two canonical momenta p<p and p^, are constants of motion. These are 

E = T + V 

1 , \ 1 /. .\2 (12.93) 

= -A \J) 2 + ^ 2 sin 2 0j + -C \J>cos0 + ipj + Mgh cos0 

Pv- = ^ = C (j>cos 0 + ip') = const. = Cu >3 

and 

Pt = ^ = A# sin 2 0 + C (j> cos 0 4- ip') cos 0 

= const. = D say 

Using Eqs (12.94) and (12.95), ^ and ip can be eliminated from the expression for the 
energy E given by Eq. (12.93), leaving it as a function of 0 and 0. Thus Eq. (12.93) 
reduces to a first order total differential equation in 0. Its solution 0{t), and hence the 
solutions <p{t) and ip(t) obtainable from the expressions for 0 and ip in terms of 0 and 0 , 
constitute the complete solution to the above problem. 

12.22.2 The Eulerian Method 

Alternatively, we can use Euler’s equations of motion, namely, 

^ + u> x L = r (12.48) 

at 

where L and T are the angular momentum of the top and the external torque applied on 
it, both expressed in terms of the unit vectors of the body frame. Here 

r = rxf = fcix(- Mgx) 


(12.94) 

(12.95) 


Copyrighted material 



398 Classical Mechanics 


In terms of the Eulerian intermediate frames defined in the previous section (Si,S 2 ,S 3 ) 
we have 

it = t 3 = (i in the S 3 frame) 
z = k\ = (i in the Si frame) 


Therefore, 

T = - Mgh(k 3 x ki) 

= - Mgh(it 2 x k t ) 

= - Mgh{- ti) 

= Mgh(cosrf)iz - sin^is) 
Since r 3 = 0, the third of Euler’s equations is 


(12.96) 


Cw 3 = r 3 = o 


which leads to 


Cti / 3 


(7 (0cos0 + ^ = 


const. 


Again, since T is perpendicular to *i, we have T* = 0 leading correspondingly to 

p* = constant. And finally the energy E can also shown to be a constant of motion by 
differentiating Eq. (12.93) with respect to time and plugging Euler’s equations of motion 
in it which leads to a cancellation of all the terms. These facts may then be used to solve 
the dynamical problem completely, as outlined at the end of the Lagrangian method. So 
once again one can see the superiority of the Lagrangian method over the Eulerian one, the 
latter being essentially a Newtonian scheme based on the concept of forces, torques, etc. 


12.23 DETAILED STUDY OF THE MOTION OF A SYMMETRIC TOP 

Let us now initiate the procedure outlined in the last paragraph of section 12.22.1. Elimi¬ 
nating j> in Eq. (12.93) by using Eq. (12.95), we get 

AO 2 + A ( ° ~ Cu;3C08g "| sin 2 0 + 2Mghcos6 + = 2 E 

\ A sin 9 ) 

Putting 2 E - Cu\ = E\ 

,\2 

•J -I- 2Mgh cos 9 sin 2 0 = E' sin 2 0 (12.97) 

The equation of motion expressed in Eq. (12.97) corresponds to a one dimensional motion 
in 9. One may regard 1 /2A9 2 as the kinetic energy of the 9 motion for which the effective 
potential energy V c ff(9) is given by the rest of the terms, that is, 

.. 1 . (D — Cu >3 cos9\ 2 . ... n 1 

V cf t = -A I - — - - -) sin 2 9 + M gh cos 9 = -E 

2 V Asm* 9 J 2 


A sin 2 9 9 2 + A 


Copyrighted material 



400 Classical Mechanics 



Fig. 12.18 Finding roots of the cubic equation /(z) = 0 


Let us denote these three distinct real roots by zi,z 2 ,z 3 with z\ < z 2 < Z3. The 

change of sign of /(z) between z = + 1 and z = + 00 implies either all the three or at 

least one root must lie above z = + 1 . 

Since /(z) must be > 0 over the allowed range of z namely \z\ < 1, it is imperative 
that there exist a range namely z\ < z < z 2 for the allowed motion, implying that the 
motion must be bounded in 0 between say, 0i > 0 > 02 corresponding to z; < z < z 2 , 

so that only the third root Z 3 can lie above z = + 1, as shown in Fig. 12.18. 

However, if the initial conditions are such that D = Cu)s, /(Z 2 ) = 0 for z 2 = 1 or 
02 = 0 , meaning that the top becomes vertically aligned at some stage of its motion. 

Again, since the top is spinning on a flat table or ground, the value of 0 can never exceed 
tt/ 2 , hence z can never be negative (unless the top is hung like a gyroconical pendulum), 
so that the actual allowed range of 0 is 0 < 0 < ir/ 2 , or 1 > z > 0 . 

Let us further introduce another function defined by 

sM = 

so that 

g(z) = AO 1 = E - 2 Mghz - (12.99) 

As z -♦ ± 1 , g(z) -* - 00 and the allowed solutions must have positive values of g(z) 
(see Fig. 12.19) with, as before, only two possible zeroes between z = ± 1. 


Copyrighted material 




Rigid Body Dynamics 401 



Fig. 12.19 Graphical representation of the function g(z) showing the roots 
of the equation 9(2) = 0 

We now study in detail various rases of top motion. 

Case I: D = C 0 J 3 Z — The Rise and Fall of the Top 

In this case 9\ and $2 are the two distinct roots of g(z) = 0, so that for 9 — By and 
6 = $ 2 , 0 = 0. Graphically this corresponds to three possible subcases as depicted in Fig. 
12 . 20 , depending whether the sign of <j> changes or not. 



(a) 



(b) 



(c) 


Fig. 12.20 Three cases of a processing and nutating symmetric top. The rate of precession 
can be faster than, as in (a), equal to, as in (b), or slower than, as in (c), a critical 
value depending on initial conditions 


In all the three subcases the variations of 6 are drawn in such a way that the value of 6 
at 0i and 0 2 is zero. So there will be a wobbling in the 8 motion. Any periodic wobbling 


Copyrighted material 





402 Classical Mechanics 


in 0 motion is by definition called a nutation. 

In the subcase (a), it can be seen from Fig. 12.20a that when 0 vanishes at the extremities 
of 0 ,<P does not vanish and > 0. So the free upper end of the top describes a curve like 
the one shown in Fig. 12.20a. 

In the subcase (b), the initial conditions are such that when 0 is at the lower extremity, 
that is, at the upper extremity for 0, > 0 at 0 = 9\\ and 0 < 0 for 0 = 02- This 

will make the motion of the top wobbling like the one shown in Fig. 12.20b. 

In the subcase (c), the initial conditions are given in such a balanced way that when 0 
vanishes at 0 = 0 2 , the value of also exactly vanishes, but at the other extremity of 0, 
namely at 0 = the value of is positive. Under this situation, when the top reaches 
the uppermost point its kinetic energy is reduced to zero, because it moves neither in 0 nor 
in ip. So the top must momentarily fall vertically forming a cusp at the topmost point. This 
is shown in Fig. 12.20c. 

The required conditions on M, h, D, C, A, up and E' for satisfying any one of the 
above cases can in principle be derived, but the derivations are quite tedious. A book in four 
volumes was devoted to present various possible cases of top motion, by Arnold Sommerfeld 
and Klein, who pioneered the systematic study of these motions. 

Cast II: The Initial Condition* are such that D = Cuij and 0^0 at 0 = 0 (or z = 1) 


For this case, 

»(*) = E‘ - 2 Mghz - || ~ = A6 2 (12.100) 

The equation g(z) = 0 is now quadratic in z with one root i 1, because at z — 1, 

S(l) = E' - 2 Mgh = A0 2 > 0 (12.101) 

by the chosen initial condition. Therefore, the allowed range of z is between z = Z\ (say) 
and z - 1 (see Fig. 12.21). 

Hence, the axis of the top may periodically pass through the vertical and can come down 
as far as z = zi, or 0 = 0\. As seen from above, the axis of the top may appear to trace 
a curve as shown in Fig. 12.22. 

Case III: Sleeping Top — The Initial Conditions are such that D = Cwj, and 6 = 0 
when 0 = 0 

If 6 = 0 when z = 1, we get 

g(z) = 0 = E' - 2Mgh (12.102) 


Copyrighted 




404 Classical Mechanics 


(see Fig. 12.23). 



Fig. 12.23 Admissible solutions to g(z) > 0 with one solution at z — 1, for the cases: 

(a) g'(z) ^ 0 within (-1,1], (b) g'(z) = 0 at z = 1, and (c) g'(z) = 0 in 
the interval (-1,1) 


Now let us see what slope the function g(z) has at z = 1. We have, 


$'(*) 


= - \2Mgh 


CM 

-4(1 + z) 


+ (1 - *) 


CM 

A( 1 + z) 2 


Therefore, 


g'(z = 1) = - 2A /gh + 


CM _ CM ~ 4 MghA 


2A 


2A 


(12.104) 

(12.105) 


Depending on whether g'(z) > 0, g'(z) = 0, or g'(z) < 0, one can think of three 

subcases (a), (b) and (c) shown in Fig. 12.23 with the equivalent required conditions 


CM > 4 MghA 

= AMghA and 

< AMghA respectively. 


The upright vertical posture of the top as a possible solution is assured in all these three 
subcases (a), (b) and (c), with varying degrees of freedom to deviate from this upright 
posture. When the top can continue to spin, keeping its spin axis upright, the top in this 
spinning state is called a sleeping top. For subcase (a), 

CM > *MghA 

and the sleeping top is stable; it simply cannot fall down due to the lack of any other possible 


Copyrighted 



Rigid Body Dynamics 405 


value of 0 as a solution. Sometimes a top in this state is also called a strong top. The above 
condition can be rewritten for this case as 

> 2 (£) Mgh 

This means that the amount of rotational kinetic energy is more than sufficient to raise the 
centre of mass of the top by a height 2 h. In fact, the stable static equilibrium posture of a 
top corresponds to a state of a freely hanging top, that is, 9 — 7r, and its CM has to be 
raised by a total height of 2 h in order to make it upright or sleeping. Also in the sleeping 
state, the externally applied gravitational torque disappears, and therefore, the strong top 
continues in its sleeping state for an indefinite period. This is an example of dynamically 
stable equilibrium state of motion. 

For subcase (b) as depicted in Fig. 12.23, the spin must have reduced from its value 
suitable for subcase (a) and now satisfies the condition 

C 2 w\ = 4 MghA 

The top becomes critically unstable for maintaining its sleeping state any longer. 

Due to unavoidable frictional loss of energy the spin rate continues to decrease further 
and case (c) is realised with the condition 

C 2 u>l < 4 MghA 

The top now begins to slip down from its initial upright sleeping state. Such a top is called 
a weak top , which cannot precess at a value of 9 less than 6„ given by the condition 

C 2 w\ = 4MghAcos9 0 

Since g(z) = - 2V r e ff(0), the maximum of the function g(z) must correspond to the 

minimum of K,ff( 0 ) and therefore, a weak top must wobble about some mean value of 0 at 
which the maximum of the function g(z) occurs. 

Then with time, more and more frictional losses of the rotational kinetic energy of the 
top makes it wobble with a bigger and bigger amplitude in 8 , until finally the top falls on 
to the ground and begins to roll on the ground instead of rotating about the fixed point on 
its apex. 

Case IV: Steady Precessional Motion 

It means that there is no nutation or wobbling in 8 , that the motion is conical about the local 
vertical with a nonzero constant value of 6 , and that the motion in <p along the surface of 
the conical precession is uniform. So mathematically, this case corresponds to 9 = 9 = 0 
for some value of 9 = 0 O ^ 0 and the precessional frequency = fl = constant. 


In terms of g(z) = A9 2 , this means that both g(z ), and g'(z) must vanish for some 
value of z = z 0 (see Fig. 12.24). For the condition g'{z 0 ) = 0, we get 


_ (D - Cu/ 3 z 0 )Cu)s z 0 (D - CwjZo) 2 _ 
9 A{ 1 - zl) + A( 1 - z*)* ~ 


(12.106) 


Copyrighted 



Rigid Body Dynamics 407 


condition C 2 u)\ » AMghAz 0 in the exact expression for both fti and ft 2 respectively, 
getting 


fti 


Mgh 

Cij)$ 


( 12 . 110 ) 


and 


n 2 


C&z 
A cos 6 0 


«/ 


( 12 . 111 ) 


This is the same ft/ as obtained in Eq. (12.75). 

The above results are quite interesting. If any fast spinning symmetric top is found to 
execute a steady precessional motion, the top must initially be set into motion with one of 
the two possible precessional rates given by Eqs (12.110) and (12.111) and then the top will 
continue its motion in that given steady precessional mode. The frequency of the slower 
mode is found to be independent of 0 O , hence for any chosen value of 0 o , the required 
precessional frequency is the same. The relation between the signs of spin and precession 
is obtained by substituting W 3 = fticos0 o + ip in Eq. (12.110) and then solving for 
fti = fti(^). One should remember that ip not u; 3 , stands for the spin angular velocity. 


The frequency of the faster mode, on the other hand, is independent of the gravity g, but 
depends on d a , which is the angle between the axis of precession and the body’s symmetry 
axis. ft 2 is the same as that for a freely precessing symmetric object, namely ft/ (see 
Eq. (12.75)). This is an extremely fast precession, almost as fast as u; 3 , if not exceeding 
u; 3 . The value of ip or the spin angular velocity may become negligible compared to ft 2 . 
Practically the whole of the top’s angular momentum is due to the precessional frequency, 
leaving hardly any room for spinning about the symmetry axis (ft 2 cos0„ ~ Clo 3 /A). Also 
ft 2 changes sign as 0 O crosses 7 r/ 2 , and ft 2 —► oo as 0 O —► 7 t/ 2 . In a gravity-free 
condition, only ft 2 = ft/ exists and the required spin angular velocity is 


A — C 

ip = -—-ft/cos 0„ 

However, in the presence of gravity, the slower mode is selected in most natural phenomena 
involving steady precessions. 

We now wish to study the stability of the above steady precessional motion. 


Stability Analysis of Steady Precessional Motion: We know that 
A0 2 = g(z) 

Differentiating with respect to time t , 

2Add = g'(z)z = g'(z)(- sin d d) = - y/T~^~~z* g(z)d 

Therefore, 

2 Ad = - y/\ - z 2 g\z) ( 12 . 112 ) 

For a steady precessional motion of the top, g{z 0 ) = g'(z 0 ) = Oas0 = 0= Oat 
6 = d 0 . 


Copyrighted material 



408 Classical Mechanics 


To see how a small perturbation in 0 grows around 9 = 0 O1 we take 

9 = 0 o + c (12.113) 

where e is small compared to 0 O , so that z becomes z 0 + 6, 6 being small and giv€n by, 
z = z 0 + 6 = cos( 0 o -he) - cos 0 O - esin 0 O 
or 

6 = - c sin0 o = -cy/l - Zg (12.114) 

Substituting Eqs (12.113) and (12.114) in Eq. ( 12 . 112 ) we get 

2 Ai = -y/nr% g"(z c )S = (1 - zl)g"(z„)t (12.115) 


Here we have used 

g'(z 0 + 6) = g'{z 0 ) + g"{z 0 )S 
which is justified because 6 < z 0 . 

Since z\ < 1 and g{z) is maximum at z 0 (see Fig. 12.23), g"(z 0 ) < 0 and 
(1 — zl) > 0. Therefore, 

(i - zlW'M < o 


Let us now define 

so that Eq. (12.115) becomes 
with the solution 


= (1 - zl)g"(z 0 ) 

2A 

e + p 2 e = 0 
e = e 0 cos(p< + a) 


(12.116) 

(12.117) 

(12.118) 


Thus we see that a small perturbation e in steady precessional angle 0 O leads to small 
oscillations in 0 around 0„. Thus the steady precession obtained by requiring g(z a ) = 
g'(z 0 ) = 0 is stable. The small oscillation around 9 0 with a frequency p is called 
nutation. 


We now calculate the nutational frequency p for the slow mode of the steady precession 
given by Eq. (12.116). Differentiating Eq. (12.106) we get 


. 2C 2 u>l 2(D - Cusz)Cu>i 

9 (Z) ~ A{1 - z 2 ) A(l - z 2 ) 

SCu>sz Sz 2 (D - Cu z z) 2 

+ >1(1 - z*) 2 A{ 1 - z 2 )» 

Now we put z = z 0 in Eq. (12.119) and use Eq. (12.108) to get 

2 (Cu> 3 - 2ASlz 0 ) 2 + A 2 il 2 (l - z 2 0 ) 

P 242 


(12.119) 


( 12 . 120 ) 


Copyrighted material 



Rigid Body Dynamics 409 


Finally putting Eq. (12.110) in Eq. (12.120) we get 


C 

V = 


( 12 . 121 ) 

which is the required nutational frequency of the top motion. For C ~ A, the nutational 
frequency is practically as high as u> 3 . 

Both fti and p are found to be independent of 0 o , l)ut in order to have a steady precession 
possible the condition C 2 w\ > 4MghAz a has to be satisfied, that is, the top must spin at 
least as fast as it would require to have its rotational kinetic energy exceeding the difference 
of potential energy between the states for - z 0 and + z a . For a sleeping top, this amounts 
to 2 Mgh < 1/2 C 2 <j)\ , or C 2 u\ > 4 Mgh. Except for the factor C/A , the nutational 
frequency is roughly the same as the component of the total angular velocity along the 
symmetry axis of the top. 

Case V: Rising Top 

Sometimes a top becomes upright due to the couple produced by the frictional force acting 
on the hinge of the top. This couple has a vertical component which is responsible for 
causing an extra precession of the symmetry axis towards the vertical. The top, therefore, 
gradually becomes upright. 

The effect of friction is twofold: the friction with the air that is responsible for slowing 
the spin down and that due to the contact of the peg with the ground, which makes the top 
rise towards the vertical. We study these two effects separately. 

(i) The effect of the forces of friction with the air over the spinning surface of the top: 
Assuming that the top is spinning about its centre of mass and that the force of friction 
is linear in velocity (Stokes’ law: / = - 6mjrv), the couple provided due to the force of 
friction with air is 


r« = Y, rx f = - x Y, rx ( u>xr ') 

surface surface 

= - A J2 r 2 u) + A 


( 12 . 122 ) 


surface surface 

where A = 6irq, ij being the viscosity of air. 

By Euler’s third equation, namely 

IT = < r ">’ 


we can write 


Cu>i = - A ^2 r2u, 3 + A ^2 ( r ' <rf )( r • *) 

surface surface 

C<1>3 = - A ^ r 2 W3 + A W3(r • k) 2 + A ^2 (wj.rx)r|| 


Due to the symmetry about the k -axis all the terms in the last sum will cancel pairwise, 


Copyrighted materi: 



Rigid Body Dynamics 411 


As a result the top rises towards the vertical. This phenomenon is called the rising top. 

As the top rises, its potential energy increases at the expense of the rotational kinetic 
energy; so the rotational speed decreases. If the top is initially given a very high spin, a 
rising top will finally settle in a sleeping top, as in this state r x f s vanishes and the 
precession stops. However, the loss of spin rate due to the other components of frictional 
torque continues to act. On the other hand, if the sleeping state of the top on the ground 
stops before the top becomes vertical, the rolling of the top takes over. 


12.24 EXAMPLES OF TOPS AND THEIR ANALOGUES 
12.24.1 Tippe Top (Topsy-turvy Top) 

This delightful toy top has the fascinating property that, given sufficient spin about its axis 
of symmetry in the statically stable orientation, it will turn itself upside down and then 
behave like a sleeping top. No matter what the orientation of the top was with respect to 
the initial vertical spin, it will end up standing on its leg. People like Sir William Thomson 
and Niels Bohr were interested in this problem, but the first correct explanation came to 
light in the early 1950’s by C. M. Braams and W. A. Pliskin. 

Tippe top has the shape of a part of a sphere with a small stem added to it (Fig. 12.26). 
When this top is spun on its head (Fig. 12.26a), such that the centre of mass C lies below 
the centre O of the spherical part, it gradually proceeds to flip over and finally it rotates 
on its stem (Fig. 12.26b). If we compare the initial and the final positions of the top, we 
notice that 

1. the CM rises indicating an increase in the P.E. at the final position, 

2. the sense of rotation of the top with respect to an axis (Z axis in the figure) fixed in 
the body of the top changes in the process of flipping, so as to keep the direction of angular 
momentum unchanged. 

The former point implies that there must be corresponding decrease in the K.E., that 
is, a decrease in the vertical component of the angular momentum. The decrease must be 
caused by an external vertical torque which can be nothing other than the torque due to 
the force of sliding friction (air friction is neglected). 

Having argued qualitatively about the cause of toppling, let us now try to get a picture 
of how the frictional force actually does the job. To do so, we fix up rectangular Cartesian 
coordinate system XYZ in the body of the top with the origin at C. Let Z„ be a vertical 
axis. 

Consider the top at any instant during the motion so that CX is perpendicular to ZCP 
plane as shown in Fig. 12.26c. The forces acting on the system are 

1. force (W = -Wz 0 ) of gravity through C, which produces no torque about C, 

2. force ( F n = W z„) of reaction vertically upward ( along z 0 ) at the point of contact 
(P). This produces a torque N n about C given by, 

N n = r x F n = (az - Rz 0 ) x ( Wz Q ) = - WasinOx 

3. force ( Ff ) of friction acting at P opposite to the instantaneous velocity of the top at 


Copyrighted material 



Rigid Body Dynamics 413 


the top topples slowly, so that the Eulerian angle 0 changes slowly, that is, 


so that 


dO 

-J- = W* < U)y , U) t 


( l) 


U >Zo 


This means 


w, = UCO8 0 and u) v = o>sin# 


— = cot 6 
u y 

The expression for the frictional torque reveals that 

(a) for# < cos - 1 (a/ R), the y-component of the torque is positive. Hence u v increases. 
On the other hand decreases due to the negative torque. The last equation above implies 
that 6 must increase in this case. 

(b) for 6 > cos - 1 {a/R), u>„ decreases, but in practical cases u> z decreases at a faster 
rate than So, the angle 6 still goes on increasing until the stem touches the ground. 

As soon as the stem touches the ground, the force of reaction on the stem gradually 
increases, whereas it decreases on the head. Ultimately, the top rests on the stem. A similar 
analysis, however, can easily show that the top should now rise on its stem (identical as a 
rising top) provided it has sufficient kinetic energy. 


12.24.2 Wobbling of the Christmas Tree Toy 

A hollow cone-shaped Christmas tree is divided into four separate vertical sections, each 
section hinged at the bottom to a plastic circular base and held together with a single rubber 
band connected internally to each section (see Fig. 12.27). The tree is set in rotation about 
its axis of symmetry. All these tree sections widen out about the hinge point at the bottom, 
due to centrifugal action of rotation, and as a result a hiding Santa Claus becomes visible 
from within. Eventually as the rotation slows down, the tree sections begin to close, but 
interestingly enough, they do not close monotonically; the effect of nutation is distinctly 
visible. During the last stage of closing, the final round of the nutational motion allows 
the tree sections to close completely first and then reopen to complete the nutational cycle, 
letting the Santa Claus out for a brief final view. 


The analysis of this symmetric top motion can be done by the Euler-Lagrange method. 
The input expressions are 
the kinetic energy T = 2/ (# 2 + sin 2 #0 2 ) 
the gravitational PE Vi = 2MghRcos9 and 
the spring PE V 2 = 4Jb6 2 sin 2 6 

where I is the moment of inertia of each section about the point O. One may use the 
effective potential for the 6 motion to study the stability of the precessional motion. 


Copyrighted 



Rigid Body Dynamics 415 



Fig. 12.28 Longitudinal and transverse sections of a boomerang 

of curvature. So one can combine two straight boomerangs into one X- or V-shaped one 
and decrease the radius of curvature of its path by a factor of two. 

12.24.4 Manoeuvre of the Motion of a Motorcycle 

The wheels of a motorcycle at high speed may be regarded as two fast-rotating flywheels 
with their axes of rotation pointing horizontally to the left of the rider. Let us now determine 
out the sequence of operations that the rider has to follow in order to take a left turn. Prom 
experience, we know that it involves two steps of actions. First, the handle bars are twisted 
clockwise, that is, to the right (the wrong way!) in order to make the vehicle incline to the 
left, the vehicle automatically begins to take the left turn on the road. The rider allows it 
to continue till the entire curve is negotiated, just by keeping the handle bars in the normal 
position. Second, when the left turn is over, the vehicle is to be returned to its upright 
posture for which a leftward twist on the handle bars (again the wrong way!) is applied. 

The initial twist to the right produces a vertically downward couple acting on the flywheel. 
Since the initial angular momentum was to the left (horizontal) of the rider, the tip of L 
moves a bit vertically downward. This makes the vehicle lean towards the left. As the vehicle 
leans towards the left, the gravitational couple begins to act on the system. The direction 
of the gravitational couple is in the backward (horizontal) direction of the instantaneous 
motion. Hence the tip of L must change in that direction, so that the plane of the motorcycle 
wheels keeps on turning to the left. This is how the left turn is accomplished by the vehicle. 
Once the negotiation of the curve is over, the vehicle has to be made upright for which the 
tip of the L vector is to be pushed upward. This can be effected simply by exerting a 
leftward twist on the handle bars. 


12.25 FORCED PRECESSION OF THE EARTH’S AXIS OF ROTATION 
As the flattened body of the earth revolves around the sun in a fixed plane called the ecliptic, 


Copyrighted material 



416 Classical Mechanics 


with its axis of rotation inclined to the ecliptic by 66° 34', the equatorial bulge of the earth 
experiences a couple that acts perpendicular to the plane formed by the earth’s axis of 
rotation and the normal to the plane of the ecliptic. Such a torque results in a slow but 
steady precession of the earth’s axis of rotation (about the normal to the ecliptic), keeping, 
of course, the angle between the axis of rotation of the earth and the plane of the ecliptic 
constant. The orbit of the moon also lies very close to the plane of the ecliptic. The moon 
also exerts a similar torque but of somewhat greater magnitude than the sun. 



Fig. 12.29 Geometrical construction for finding the gravitational potential of the oblate- 
spheroid-shaped earth due to a point mass located outside 


In order to study this effect, one must first know the potential energy that the oblate 
earth acquires due to any distinct point mass at say P(x,y,z) with respect to the origin at 
the centre of the earth O (see Fig. 12.29). Let * be the symmetry axis of the earth and P' 
be any arbitrary point inside the earth having position vector r'. From Fig. 12.29, we see 
that 

r - R - r' and \r\ = V& + r' 2 - 2r' R (12.125) 

The potential energy of the earth due to a point mass M at P is therefore 


V(R) 


= -GM J ^ dx'dy'dz ' 


(12.126) 


The integral in Eq. (12.126) is to be taken over the total volume of the earth. To proceed 
further, we expand |r| -1 = |/2 - r'| -1 in a Taylor’s series 


1 1 
|r| "| R- r\ 


R 


(■ 


+ 


r' R 

R? 


+ 


3(r' • R) z - r' 2 R 2 
2 P 4 



(12.127) 


Copyrighted 



Rigid Body Dynamics 417 


Substituting Eq. (12.127) in Eq. (12.126) we get 

*"(*) = - ~gr J J Jpip'd**) [* + jp( x ' x + y'y + z ' z ) + ( x ' 2x2 

r n 1 

+ v' 2 y 2 + z n z 2 + 2 x'xy'y + 2 z'zx'x + 2 y'yz'z) - + ••• dx'dy'dz' 

(12.128) 

Assuming that the earth’s density distribution is symmetric about the origin, all terms 
containing a single x',y',z' are integrated to zero, as the integrand becomes an odd function 
of its arguments. Thus we are left with 


V(«) = -“///*■.✓.»-)[ 1 + 

(3z' 2 - r ,2 )z 2 } + •• 


+ w 2 

GMm 


r' 2 )y 2 

GM 

2 


r^x 2 
} dx'dy'dz' 


(12.129) 


- - A)(« S - »*’) + 


Here we have taken Ii = J 2 and m as the mass of the earth. 

Now, with respect to an observer on the earth, the sun is moving in a nearly circular 
orbit fast enough compared to the precessional rate of the earth’s rotation axis that we are 
interested in. Therefore, the sun may be assumed to be moving in a nearly circular orbit 
with a ring distribution of mass around the earth. The sun’s angular frequency of revolution 
around the earth is 

(12.130) 

From Fig. 12.30, we see that the instantaneous position of the sun is given by 
x — Rcos(nt 4- <j>) 

y = - f£ cos 0 sin(nt + <f>) (12.131) 

z = 72sin0sin(nt + <f>) 



Substituting Eq. (12.131) into Eq. (12.129) we get 


V(R) 


GMm 

R 


- !^r(/3 - /i)[l - 3sin 2 (* + nt) sin 2 0] 


(12.132) 


If we now take a time average of V(R) over a sufficiently long period of time compared 
to 2ir/n, the factor sin 2 (# + nt) averages to 1/2, so that 


W) = - ^ - |j^(/» - A) [l - (12.133) 

The torque on the earth resulting from F(-R) in Eq. (12.133) is 

T x = R x (-VV (R))\ x = ~ A)sin0cos0 (12.134) 


Copyricr’O'f: ratnrmB 



418 Classical Mechanics 



having neglected the first term which does not produce any precession.. Now the Lagrangian 
for the earth’s motion is 

L ~ \ [^i(^ 2 + sin 2 6j> 2 ) + /3(0 cos 8 + ^) 2 J 


G M. _ . / 3 . o A GMm 

+ 5 - /,)( 1 - 5 »’») + — 


Since <f> and 6 are cyclic, 


Pi = — r = (/jsin 2 ^ + / 3 cos 2 0)0 + / 3 cos 00 = const. 
d(f> 


Pi — —r — / 3 (0cos0 + = const (12.136) 

00 

Since 0 < 0 = u> = the angular velocity of earth due to its diurnal rotation, we have 
Pi = / 3 u;cos0 and p# = J 3 u; (12.137) 

implying that 


Using Eq. (12.135) and neglecting 0 in comparison with bthe equation of motion in 8 


Copyrighted 



Rigid Body Dynamics 419 


reads as 

hO + huj(f) sin 0 + ^r( 7 3 “ 7 i) sin# cos 6 = 0 (12.138) 

If the precession is uniform with 0 = constant, then § = 0 so that Eq. (12.138) becomes 

• 3GM h ~ hcosO 

* ~ ~ ~2R' h «” 

_ 3 q2 / / 3 - h \ cos 6 

2 \ fj / dearth 

where, as seen from Fig. 12.30, 9 is the obliquity of the ecliptic, and fl 8un = the angular 
velocity of the apparent revolution of the sun around the earth. 

For the total processional angular velocity of the earth’s axis of rotation, we get (since 
the moon is also almost on the ecliptic) 

* = *». + + «Lo.) (^-^ L ) ^ < 12 - 139 ) 

This gives a steady processional rate of 50.29 arc second per year and is retrograde in 
nature. Thus at this rate one complete revolution of the equinox along the ecliptic takes 
about 25,800 years. The earth’s axis of rotation does a conical precession in space with 
respect to the stars and completes one revolution in the same number of years, while the 
plane of the ecliptic remains almost fixed except for small perturbations due to Jupiter. 

Since the moon does not lie exactly on the ecliptic, it sets in a nutational motion in 0 of 
amplitude 9.3 arc seconds over a period of about 18.6 years. This periodicity matches with 
the period of revolution of the node of the moon’s orbit around the earth (see Fig. 12.31). 



Fig. 12.31 Forced precession and nutation of the earth’s axis 
of rotation 


Copyrighted material 



420 Classical Mechanics 


12.26 FOUCAULT’S GYROSCOPE 

A gyroscope is a heavy symmetrical top spinning very fast about its axis of symmetry, the 
top being mounted on a set of mutually rotating frames so as to allow the axis of rotation 
of the top to have any arbitrary orientation in space. Circular frames are hierarchically 
pivoted in such a way that the pivots on the fixed frame enable the whole system internal 
to it (topi, top2, top3) to be able to freely rotate about the vertical line. Then the inner 
frame which is pivoted on topi (see Fig. 12.32) allows top2 and top3 to rotate freely about 
a horizontal axis, called the nodal line. Finally, top3 pivoted to top2 allows the former to 
rotate about the nodal line. Top3 is a symmetric flywheel or a symmetric top which can 
spin about a direction perpendicular to the nodal line. Hence the flywheel can execute all 
the angular motions corresponding to the three Eulerian angles <f>, 0 and if). A gyroscope 
in its steady operation, that is, its flywheel spinning very fast and at a constant rate 0, has 
got the property that the spin axis always points towards a fixed direction in space even 
if the entire set up of the gyroscope is allowed to move slowly, in any manner one likes. 
This property is a consequence of the fact that the spin axis of the flywheel coincides with 
a principal axis of the moment of inertia tensor of the flywheel about its centre of mass. 

The famous French experimentalist L6on Foucault used this property of the gyroscope 
in order to demonstrate the rotation of the earth. The precessing simple pendulum under 
the action of the Coriolis force due to earth’s rotation was the first experiment of its kind 
to demonstrate the rotation of the earth in a laboratory in 1851. This classic experiment 
was reasonably successful, and was performed at the Pantheon in Paris, and also in the 
cathedrals of Amiens and Rheims. Very soon Foucault devised a second method (in 1852) 
based on the principle of a gyrostat which was not an immediate success though. It had, 
nevertheless, the potential of demonstrating the rotation of the earth experimentally in just 
a few minutes as opposed to his day long ordinary pendulum experiment. Not only would 
one save time this way but also reduce the size of the instrument and eliminate the problem 
of maintenance of free motion for long enough periods. 

Foucault’s gyroscope can have two versions, of which only one is described here. It is 
simply a fast spinning symmetric top, the axis of which is constrained to move in a vertical 
plane passing through the local meridian (that is, through the zenith and the poles). At 
any instant, its spin axis (the z- axis) makes an angle 0 with the earth’s axis of rotation. 
We define the z-axis to be in the vertical plane, perpendicular to the z-axis and y-axis 
perpendicular to both the z and z axes so that the y-axis can point in the east direction 
(see Fig. 12.33). On the surface of the earth this triad x-y-z is a rotating frame rotating 
with an angular velocity ft' which is close to that of the earth. This rotating frame is not 
the body frame of the flywheel. However, Euler’s equations of motion are still applicable to 
this (first) frame of reference where u> has now to be replaced by ft' in the Eq. (12.48) as 
ft' is the rotational angular velocity in space (see last paragraph of section 12.10). Such a 
replacement of u) by ft' can work simply because, as the flywheel rotates about its own axis, 
having a geometrical symmetry about it, the principal moments of inertia A and B remain 
identical for any frame arbitrarily rotating about the symmetry axis. In this modified form, 


Copyrighted material 



422 Classical Mechanics 


X 



I 

I 

I 

I 

I Y (East) 

Fig. 12.33 Orientation of the axis of rotation of Foucault’s gyroscope 

and the modified Euler’s equations become 

T* = - Ad cos 00 + CuJ - AdcosOO 
T y = AO - j4fl 2 sin 0 cos0 + Cdu o ain0 



Now this gyroscope is constrained to move in the vertical plane. This constraint on the 
spin axis z produces a couple only in the x direction, hence T, = T, = 0, giving 

u 0 = const. 

and 

c 

0 + —flu> 0 sin 0 — ft 2 sin 0 cos 0 = 0 
A 

The ft 2 term is negligibly small compared to the u; o n term. Hence 
C 

0 + — ftu> o sin0 = 0 
A 

For small 0 this equation gives the motion of an SHM so that the gyroscope will perform 
oscillations about a line parallel to the earth’s axis of rotation with a period 

T = 2tt 

If 0 = 0, that is, if the axis of the gyroscope coincides with that of the earth, there will be 
no oscillations and the axis of the gyroscope will point steadily towards the earth’s axis of 



Copyrighted materia 





424 Classical Mechanics 


Therefore, 

dL' 


W. = '£ m { rx %t) = £ m ( rx !:) 


d / dL'\ v-^ dr d (dr \ dr ^ 

*UJ = £ mr x d^ + 5> r * aw,) = £ mtt x 


a* 


dq, 

thus giving 


dL r-' dr ' dr 

do. ^ da. ^ dq. 


d (dL'\ dL' nSr ^ dr nXr , f dr dr\. 

— ( -rr- ) - —— = 2 > mu x - = 2 > ml — — x -r— ) q p = -d p *<? p 

dt\dqj dq, ^ dq. ^ \dq P Bq,)^ 

Clearly from the above definition of B p „, it is an antisymmetric (axial) vector. For constant 
ft, the Lagrangian equations of motion 


'I 

dr 

dv 

) - 

‘ dq. 

dq. 


+ q: 


in terms of the quantities defined in the rotating frame for s = 1, ..., n — 1 generalised 
coordinates take the form 

d /dT'\ dr 1 0 \d / dL'\ dL' 1 ia/y. . dv , 

dt \dqj dq, + ° [dt ( dq,) dq,\ 2 dq, “ dq. + 


I® - £ + < n B ^> = ~k( v ~ + Q '• (1214S) 

where V - V(q,) is the ordinary potential energy due to externally applied conservative 
forces and Q' t are nonpotential force components, if any. 

So these equations of motion differ from the case with ft = 0 in two respects: first by 
the presence of the gyroscopic terms (ft • B p ,)q, and the second by V(q,) — j/yft.ftj 
replacing the potential energy function V(q.). 


The equations of motion for the nth generalised coordinate, that is is given by 


- 

dt \drfi) 


(12.146) 


dT dL n 

^ = G gwing — = G 

where G is the couple needed to maintain the constant angular velocity ft. 

When the system is in equilibrium, that is, fa = ... = 0, 

jj-(v-!*,«*) - A- 

and if there are no nonpotential forces, the condition for stable equilibrium in a uniformly 


Copyrighted material 



426 Classical Mechanics 


for head and neck : trunk : hands : legs ratios for an average male athlete are 0.09 : 0.33 : 
0.20 : 0.38 for the surface area, and 0.07 : 0.57 : 0.09 : 0.27 for the mass; whereas for female 
athletes these ratios are 0.09 : 0.31 : 0.17 : 0.43 and 0.07 : 0.52 : 0.07 : 0.34 respectively. 
For the total surface area, the formula is the same for women except for the constant for 
normalisation which replaces 30 kg by 33 kg. 

(iii) Principal moments of inertia of the body about its centre of mass (/ rr , I yy , I xx ) for 
different symmetric configurations. The body frame axes x,y, z are defined as follows: 
x-axis: back to front (horizontal forward) about which cartwheeling is performed 
y- axis: right to left (horizontal sidewise) about which somersault is performed 
z-axis: bottom to top (vertically upward) about which twist is performed. 

Table 12.3 shows the principal moments of inertia of an average athlete of 60 kg body 
weight. 


Table 12.3 Moments of Inertia of Human Body under Different Configurations 


Configuration 

Layout, 

arms 

at sides 

Layout, 

arms 

overhead 

Layout, 

arms 

out 

Layout, 

twists 

thrown 

Relaxed 

I xx (kg m 2 ) 

13.5 

17.9 

16.6 

15.0 

10.8 

J yy (kg m 2 ) 

12.0 

16.0 

13.3 

13.5 

10.3 

I xx (kg m 2 ) 

01.5 

01.5 

03.5 

01.3 

04.4 


(iv) Body’s rate of expenditure of energy (K in Cal/hr/kg of bodyweight). There are 
basically two types of food as well as replenishable material in our body that are capable of 
delivering energy to or out from the body. One is fat (oily substances) with its calorie value 
7700 Cal/kg (1 Cal = 1000 cal = 4200 J) and the other is carbohydrate with its calorie value 
3500 Cal/kg. It means that if you eat 100 gm of any oil/fat or 220 gm of sugar/carbohydrate 
you gain about 770 Cal, and that if you work physically worth 770 Cal you will lose 220 gm 
of body’s sugar content or 100 gm of body’s stored fat in case sugar is not readily available. 
Here is a table for the energy expenditure rate ( K ) for various activities in Cal/hr/kg of 
bodyweight (see Table 12.4). 


In total, a typically hard-working person accounts for a daily expenditure rate of energy 
of about 30 Cal/kg of bodyweight on various items listed in the above table. The other 
major losses are due to required supply of heat of evaporation of sweat and the heat loss of 
the body to the surroundings in order to maintain the temperature difference between the 
body and the immediate surrounding. 

12.28.2 Strolling or Leisurely Walking 

When one walks in a leisurely fashion, one does not exert much effort consciously. The. 


Copyrighted material 



Rigid Body Dynamics 427 


Table 12.4 Power Consumption in various Human Activities 

Slow 

K 

Fast 

K 

Activities 

(Cal/hr/kg) 

Activities 

(Cal/hr/kg) 

Sleeping 

1.00 

Walking(4.5km/hr) 

3.3 

Sitting still 

1.30 

Carpentry 

3.8 

Standing relaxed 

1.55 

Active exercise 

4.2 

Sewing by hand 

1.65 

Fastwalking(6.5km/hr) 

4.4 

Dressing/undressing 

1.75 

Going down steps 

5.0 

Singing 

1.85 

Loading heavy objects 

5.5 

Typewriting 

2.00 

Heavy exercise 

6.0 

Washing dishes 

2.10 

Tennis play/swimming 

7.2 

Sweeping 

2.20 

Very heavy exercise 

9.0 

Light exercise 

2.75 

Going up steps 

15. 


process of such walking can be approximated to the natural pendulum-like oscillations of 
the legs about the respective hip joints. If the centre of mass of the legs of length L lies 
a distance d below the hip joint and the two legs make a maximum angle 20 at the apex 
during walking, the half period of oscillation of the legs is simply 

T 0 = 7T 



and during this time, the person moves through a distance x 0 , the length of stride, 


x 0 = 2.L sin 6 


(12.148) 


Thus the average speed of walking leisurely is 


x 0 2L sin 0 

" T 0 ~ ~V~\d 


2 

7T 



(12.149) 


where A is defined through d = XL , A < 0.5. 

Therefore, (i) the longer the legs the faster is the speed of walking, (ii) the bigger the 
stride the faster is the speed, (iii) the speed is independent of the weight of the body, and 
(iv) walking on the surface of the moon will be about 2.5 times slower than that on earth, 
because of the reduced ^-factor. 

Taking A = 0.4 L = 0.8m, and small 6 , one gets 


v 0 = 2.8 x 0 m/s ~ 10 x 0 km/hr 
If0 ~ 15° v 0 ~ 2.6 km/hr. 

Now when the two legs are farthest apart (20), the centre of gravity of the body is lowered 


Copyrighted material 



Rigid Body Dynamics 429 


joints vary alternately between zero and a maximum value of /u>*/2, where I is the average 
MI of the leg about either end and w 0 is the maximum angular velocity of both the legs in 
each stride. The constant supply of this energy comes from the work done by the thrust of 
reaction acting on the forward leg that rests on the ground and bends at the knee, and the 
force acts over a push-off length s of the bent leg. If the maximum force available in the 
form of the thrust of reaction from the ground available is fi times the body’s weight Mg, 
the energy equation simply gives 

2 x < (iMgs (12.152) 

where u> 0 is given by 



v 0 being the maximum speed of the runner and L being the length of each leg. 

Here the uncertain parameters are /, fi and s. However, one can perhaps take 

1 M 

I = -m t L 2 n = 1.5 and — = 7 - 7.5 
3 771/ 

where m/ is the mass of each leg. For an estimate of s, which is approximately the separation 
between a fully extended leg and a bent one, or the distance through which the CM of the 
body moves while the leg is on the ground, one can assume the location of the knee to be 
at the middle point of the leg and the maximum bending of the leg at the knee is as large 
as 90°, giving 

. * < 2 . o.3x 

2 

So finally we get, for an average athlete, 

*. - y* ( 1 - mL (J£) - 8 - 7m/s (i2i53) 

In order to increase the value of v 0 further, a professional runner must have longer (L ~ 1 
m) but lighter legs (M/mj ~ 8), relatively longer forelegs compared to thigh (for achieving 
larger value of s) and strong enough to produce larger thrust (fi cz 1.5) so that 

v 0 ~ 10.2 m/s 

or in other words, for a 100 m run, the runner would take about 9.8 s. 

However, using certain drugs such as steroids, one can illegally increase the strength 
factor fi , which was the reason for Mr. Ben Johnson’s earning notoriety in the 100 m men’s 
running event, in the 1988 Olympics. 

Since the work done by the legs during each stride is W a = fiMgs and the final kinetic 
energy of the runner becomes 



Copyrighted materia) 



430 Classical Mechanics 


It means that the runner can accelerate himself/herself to his/her maximum speed of running 
at the end of the first 10 — 12 strides. This is universally true for all runners, amateur or 
professional, as it depends simply on the ratio of M and mi. 


12.28.5 Maximum Range of Long Jump 

We have seen that at the end of about a dozen strides, the athlete gains the maximum speed 
of horizontal running. For a long jump (also called broad jump) the runner has to throw 
himself off the ground with an angle of elevation say 0j in order to increase the range of his 
jump. In absence of any drag, the horizontal range of the runner as a projectile is 

Ri = — sin 26 i (12.155) 

9 


where V 0 is the speed just before taking off the ground. Obviously, Ri is maximum for 
9i = jt/ 4, giving R t = V*/g. However note that dRi/ddi = 0 for 0i = ?r/4, so 
achieving maximum value of Ri near 0, = 7r/4 is relatively insensitive to 0,. For a range 
of 9i between 40° and 50°, the variation of Ri remains within about 2%. 

Putting V 0 = 10 m/s, Ri = 10 m, and even if 0j = 30°, Ri reduces only to 8.7 m. 

It seems that the best runner would also succeed as the best champion of broad jump. 
But this is not so simple. The art of this game is to produce the right take off angle which 
should be no less than 30°. The entire linear momentum of the body has to change its 
direction during the last stride only. It requires a special technique so that the last push 
off becomes sufficiently long. The vertical component of the momentum generated in the 
final push off has to be comparable to its horizontal component, the one that has been 
achieved in about 10 strides. Usually the runner lowers the level of his/her CG and thrusts 
himself/herself up and the duration of the last push r is made to be the longest possible 
one. 

Now, after n (n < 12) strides on the ground before the final one, the horizontal speed 
becomes 



During the last stride the gain in the vertical component of the velocity can be approximated 
to 

v± = tigr = fig— (12.156) 

V|| 

where 0, defined through the above relation, corresponds to an effective length of the final 
stride. We can treat 0 as a constant only if U|| > 0, which is true for broad jumps but 
not for high jumps. Since tan0* = v±/v\\ and V 0 2 = ujj + v\, we can substitute it back 
in Eq. (12.155), and get 

R, = 2 n0L 


and 


0, = tan 


0 


(2 - y/2 )n 


(12.157) 


The extra parameter 0 plays a crucial role in determining Ri apart from n and L. The 


Copyrighted material 



Rigid Body Dynamics 431 


current world record of long jump Ri ~ 8.9 m requires a value of 0 ~ 3, and hence 
6i ~ 27° forn ~ 10. 

12.28.6 Maximum Height of Vertical Jump 

In this game, it is the centre of gravity of the athlete that has to be lifted as high as possible 
by generating the maximum amount of the vertical component of the impulse using the 
maximum possible thrust received from the ground. The separation between the lifted CG 
and the reference horizontal bar (h f ) is also to be minimised. We know that the CG of the 
upright body lies inside the body, but if the body is given a shape of an inverted ‘V’, the CG 
will not only come out of the body but also remain in a low position. It is known that one 
can topple over a horizontal fencing keeping the CG all the time slightly below the upper 
boundary of the fencing. So an athlete can in practice achieve h' < 0 in high jumps. 

Suppose just before the final take off the athlete brings his/her CG down by a height 
h 0 8 (8 < 1) from its usual original height h 0 . Then the athlete exerts maximum possible 
thrust on the ground so that the force of reaction pMg begins to act on the body. It can 
continue to act so long as the feet remain on the ground, that is, the CG does not go off 
above its normal height h 0 . Since the work done on the whole body is fiMgh 0 8, after the 
take off the CG will continue to rise a further height of fih a 8. Hence the total height of the 
CG above the ground at its peak of climbing becomes 

H + h' = h 0 ( 1 + fi8) (12.157) 

where H is the height of the reference bar from the level of the ground. 

For a typical athlete, h 0 = 1 m, 8 = 0.6, // = 1.5 and h' = 0, thus giving 

H = 1.9 m 

By stretching all the parameters to the extreme, one can perhaps achieve h Q = 1.1 m, 
8 = 0.75, // = 1.6, h! = -0.05 m in which case H — 2.47 m, the present world record 
of high jump being 2.44 m, held by Javier Sotomayor of Cuba. 

The interesting point to note is that in Eq. (12.157) the value of g scales out from both 
sides, and therefore, the statistics of high jump will hardly improve on any other celestial 
object where the value of g is markedly different, except that the value of the //-factor 
might be substantially higher in conditions of lower gravity. 

12.28.7 Throwing 

In throwing a javelin, discus or shot put, the muscular strength of the athlete’s hands 
becomes the most important factor. From weightlifting statistics, it is known that the 
highest limit of lifting a dead load Wj lies somewhere between 4 and 5.2 depending on 
the weight class of the weightlifters, the ratio being highest for weightlifters having body 
weights 60 - 70 kg. Since the weights are lifted by hands, the limits strongly depend on 
the cross section of the weakest joint in the arm (A w ) and the total surface area of the feet 
that supports the reaction, apart from the strength of the muscle fibres of the arms and 
legs. Hence the muscular strength of the hands for sustaining a maximum pressure can be 
expressed by the ratio Wd/A w , where A w is the cross-section of the wrist. 


Copyrighted 



432 Classical Mechanics 


Now in all kinds of throwing processes the hand makes a final swing before the projectile 
is thrown out in the air. Hence the entire arm, due to the circular motion of the final swing, 
must experience an outward centrifugal force and the shoulder joint has to experience a 
tremendous amount of centrifugal pressure. This happens even when one attempts to throw 
a small stone chip, let alone a javelin, discus or shot put. During weightlifting, the outward 
pressure is maximum at the point of smallest cross-section, that is the wrist joint, but for 
throwing, the outward centrifugal pressure becomes maximum at the shoulder joint, which 
is given by 

p« = j-£ k Ami)u*ii 


where Lh is the length of the hand, A(l) and p(l) are respectively the cross-section and 
mass density at a distance l below the shoulder joint, u> is the approximately constant 
angular speed of the arm at a given instant and A a is the cross-section at the shoulder joint 
(I = 0). So 

Pd - ~ sfe (m58) 

where m* is the mass of the hand and v is the speed of swing of the projectile held in the 
hand. 

At the time of release, the maximum speed of throw v max can reach high enough to 
satisfy the following equation 

* = ^ 


as the weightlifting is done by using both the hands. Taking W d = ti'Mg, we get 
2 L h A a W d 


m h A w 




where 7 and n' are defined through the above relations. 

It is desirable that the thrower have the largest possible value of 7. We can take \j! - 
5, (M/m/i) = 20 and(A 0 /A u ,) = 1.5, giving 7 = 150. This means'that a swinging hand 
can sustain a maximum outward centrifugal acceleration of the hand of about 150 times the 
value of g. 

Combining Eqs (12.155) and (12.158), it is now easy to find out the maximum possible 
range of throwing a stone chip, for example (without of course imparting any CM motion 
of the body to the projectile), which becomes 


R. c = 7 L h (12.159) 

or about 150 times the length of the hand. If Lh - 0.65 m, the maximum range of throwing 
a stone chip without running becomes a little less than 100 m. If by just one swing of hand 
you can throw up to a distance of say 50 m, you know that you can achieve a centrifugal 
acceleration on the order of 75 g, which also implies that after the act of throwing you would 
feel your hand to have become tired, as if it had lifted a dead load from the ground, weighing 
about 2.5 times your body weight. This proves not only that our arm muscles are generally 
very strong, but also that their actions are extremely swift. According to Eq. (12.158), a 


Copyrighted 



Rigid Body Dynamics 433 


good thrower should have a relatively lighter, longer and stronger arm. 

Now if you run to bring the speed of your CM to v 0 before you throw the stone chip, the 
maximum range would become, using Eqs (12.155) and (12.158), 

Rm = (W + Vo) 2 8^ 2fl. + ^ (12.160) 

9 

the extra term R 0 being a small correction due to the nonzero height at the moment of 
the throw (hi). Usually Ro ~ hi. For v 0 = 8 m/s, t; m »* = 30.9 m/s and Ro = 1 m, 
Rm = 155 m. 


(i) Throwing a Javelin 

The javelin is basically a long but light rod pointed at one end. Since it is long and held in 
hand at the arm’s length, it will produce an extra centrifugal pressure at the hinge point of 
the swinging arm. One has therefore to add a term to our expression for P e f before equating 
to Wil2A v . The whole effect can be viewed as an increase in the effective mass of the hand 
m*. An extra factor ct/ by which the value of m* increases would depend on the mass and 
length of the javelin, the point where the javelin is held in hand and its orientation with 
respect to the straightened arm. Since one is allowed to run before throwing the javelin, 
one would have from Eqs (12.158) and (12.160) 




2 


+ v 0 


sin 2 0i 4- Ro 


(12.161) 


which for v 0 = 8 m/s, a/ = 1.7 gives R m = 104 m, assuming that at the moment of 
throwing the initial height Ro of the CG of the javelin was about 1.5 m above the ground. 
The current world record seems to stand at 104 m, set by Uwe Hohn of GDR. 


(n) Throwing a Shot Put 

In this game a 16 lb lead shot is to be thrown from within a small marked circular zone on 
the ground without lifting at any time both the legs from the ground. With this heavy shot 
in one hand, the value of at/ in Eq. (12.161) becomes 4.8 or so, and the effective value of 
7 does not quite reach 150 in just little more than half a swing. The thrower also attempts 
to achieve as high a forward centre of mass speed (t> 0 ) as possible. This is done by turning 
back, then skidding backward on one leg (which makes him attain a forward CM motion), 
and finally making a half swing of the body as well as of the shot which follows a calculated 
spiral orbit before release. At the same time much effort is spent to raise the CM of the 
body so as to impart maximum possible vertical component of velocity to the shot. The 
typical values of ~ 2 m/s, 7 ~ 120 and R a = 1.5 m giving 


R m ~ 23.3 m 


the current world record being about 22.9 m. 

(iii) Throwing a Discus 

A discus is not much heavier than a javelin, but because of its characteristic shape it requires 
a spin which endows it with a gyroscopic stability in its long route of directed motion through 
air. It being unlike a small cricket ball, adequate spin cannot be imparted just by the use of 


Copyrighted 



434 Classical Mechanics 


fingers and the thrower makes one or two complete turns about himself/herself before the 
discus is thrown. With a/ = 1.8, v a = 3 m/s, 7 = 150 and R m = 1 m, Eq. (12.161) 
gives R m = 70 m. 

12.28.8 Cycling 

Cycling is performed by exerting muscular power of the leg on the pedals of a bicycle. In 
order to find the maximum speed of cycling achievable by a professional cyclist in a race, we 
must first know what maximum power is available from the trust of the legs on the pedals. 
We can take this from the performance of an athlete in a 100 m race, for example. The 
configuration of legs during fast running suggests that the length of the strides is about 
L/y/2, for the expression that we have used for s in the subsection on running. Hence using 
Eqs (12.152) and (12.153), the maximum power spent running in a race per unit mass of 
the body is given by 

^“t(¥)t=°^‘- 5 ® , ' 5£0S < 12162 > 

For an average athlete, K r ~ 54 W/kg (= 46 Cal/hr/kg for easy comparison with various 
entries in Table 12.4) of the body weight. Since running with such a high power lasts only 
for 10—100 sec, a 60 kg person shall not lose more than 8-80 Cal. The maximum power of 
leg thrusting on hard ground can thus become 2.7 kW for a 50 kg athlete, which is about 
3.6 HP! 

If the bicycle racer keeps on feeding power at this rate to the cradle of the bicycle, the 
speed should keep on increasing endlessly. But as it gains speed the losses become more 
important. For a runner on the ground, the loss was mainly due to periodic acceleration and 
deceleration of the legs. We calculated the final steady speed of running by equating this 
rate of loss to the pumping rate from the thrusting of legs on the ground. In cycling too, 
the loss due to periodic acceleration and deceleration of the legs on pedal is unavoidable, 
but it is now about 10 or 15 times smaller than the pumping rate (the exact factor depends 
on the diameter of paddling wheel, height of the bicycle, length of the legs and the speed 
ratio of the wheel to the cradle). The main loss is however due to combating the air drag. 
In the speed range we are interested in, the drag law is a quadratic one and is given by 

Fp = \c D p a Av 2 (12.163) 

where Fp is the drag force experienced by the moving system (= rider + bicycle), A is 
the frontal cross-section of the whole system, Cp = 0.9, p a = density of air and v is the 
speed of the system. The loss of power due to this drag is Fpv. Keeping a factor A (slightly 
greater than unity) in order to include the losses by the legs and a possible difference in the 
value of p for thrusting on pedals rather than on ground, we can have for the maximum 
attainable speed Vb in bicycle races 

v b = 


(2V2 _jni_y 
\3XCpp a AL ) 


(12.164) 


Copyrighted 



Rigid Body Dynamics 435 


where v 0 is the maximum speed of the athlete as a runner in, say, a 100 m race. The 
ratio of Vb to v 0 seems to be fairly independent of the athlete’s capacities, as m/ is nearly 
proportional to the product AL. For an athlete keeping his/her head vertically up while 
cycling, the net frontal area is approximately 



the numerical factor 0.5 becomes about 0.45 if both the trunk and the head lean sufficiently 
forward. Obviously such a posture is helpful in achieving higher speed as A occurs in Eq. 
(12.164) in its denominator. For an average athlete the constant of proportionality in Eq. 
(12.164) becomes about 2.51 for (A = 1 . 1 ), thus giving 

vi, = 21.8 m/s 

the present world record of maximum speed in bicycling being about 22 m/s. 

12.28.0 Swimming 

Swimming is a dynamical process in which the swimmer gains a net horizontal momentum 
by throwing parts of the body in and out while the body remains entirely supported by 
a pool of water. Since there is no component of gravity (the external force) that can act 
in the horizontal direction, the centre of mass of the body cannot acquire any horizontal 
component of linear momentum by merely performing any number of internal motions, such 
as throwing hands and legs, etc. So one has to transfer a net horizontal component of 
momentum to the surrounding medium, so that the body can receive an equivalent amount 
as a reaction. During running the momentum is transferred to the earth as an action and 
during swimming it is transferred to the surrounding water medium. In both the cases the 
body receives the reaction which is capable of producing the motion of the centre of mass 
of the body. 

There are four different styles of swimming depending on the mode of generating the 
required thrust on water. In freestyle , strokes by hands are performed longitudinally and 
alternately. In butterfly stroke, longitudinal strokes of hands are made in unison. In breast¬ 
stroke, the hand strokes go sideways with hands remaining fully outstretched. In backstroke, 
the hand strokes are quite similar to those of the freestyle mode, but the body is now kept 
always in the prone position. Here we shall consider the butterfly mode for reasons of 
simplicity. 

Let us analyse a case when the strokes are produced both by hands and legs. The hand 
strokes begin with both the hands stretched in the forward direction. Let 7 g be the constant 
acceleration of the tip of the hand, produced by the muscles of the hand, and as a result, at 
time t, 8 be the angle that each arm make with the forward direction. Assuming as before 
the hands to be uniform rods of length L mass and longitudinal vertical cross-section 
Ak (that effectively faces the force of drag from water), the centre of mass of each hand 
would experience a net horizontal force as the sum of F cxi = m/, 75/2 and the drag force 
Fd = -2 CDPv>AkV* m /3, where Co — drag coefficient (for swimming speeds Co = 0.7), 
Pv> = density of water, and v cm = the instantaneous speed of the CM of the hand. Since 


Copyrighted 



436 Classical Mechanics 


Fd oc v| m , it wins over the muscular force at some stage after which deceleration takes 
over and brings the hands to rest with respect to the shoulder joint say at t — t 0 when 
9 = 0 m ax> say. Since both the final and initial values of v cni with respect to the body 
are zero, the hands must have transferred all the momentum they gained in between^ to the 
surrounding water. 


The net horizontal forward component of this momentum of each hand that goes to the 
water in each stroke is t 

Ph = - j rrikVcm sin 9dt 

The equation of motion can be exactly solved in order to get expressions for ph, 0 max and 
t„ in terms of 7 , g , m*, Cd, Ah and p w . The whole body receives the momentum - ph 
as a reaction and moves forward with a final speed Vj (starting from zero, of course) at the 
end of t = t a , given by 

= 55^71 (77) - 122 (lr) (12.165) 

and time taken 


to 


3^Sr l L k 8 mhX 

2V2 V 7 9 



where 9 t 


2y/3 m k 

CoPwAhLh 


(12.166) 


Note that the most uncertain parameter Ah is eliminated using the expressions for the 
directly observable quantity 0 max . The above expressions will be valid also for the leg 
strokes with appropriate values of the 7 factor, L and mj. Since a leg can exert a 
maximum of force pMg, its lower tip, that is foot, can produce a maximum acceleration of 
2 pMg/mi = 7 'g ~ 22 g. Also note that in the opening out phase of the legs, the body 
is retarded, while in the returning phase the body gains a forward momentum, if both are 
performed by thrusting water. 


The return strokes of the hands could retard the whole body almost exactly by the same 
amount as it had accelerated in their forward strokes, but the return strokes are carried out 
in air instead of in water, and therefore retardation of the body due to muscular efforts for 
resetting the hands back to the original positions are avoided. It is the drag force of water 
that acts on the whole body during both the onward and return parts of the strokes and 
continues to retard the motion of the CM. For maximum gain in thrust, the hands should 
open out in water, but the legs should open out in air. If the swimmer fails to do so, or 
does it the wrong way, the efforts of the legs will simply reduce the speed of swimming to a 
great extent. From Eq. (12.165), one can achieve V/ ~ 1.2 m/s at the end of each stroke 
of the hands if 9 mnx = 30°, and can go up to 1.7 m/s if 0 max = 60°. 


It is obvious that the maximum average speed V c achieved by the swimmer will depend 
on the values of V/, the total surface area of the body that experiences the drag and the 
relative lengths of the passive intervals between two successive active phases of the strokes. 
During each active part of the stroke by hands, the momentum of the whole body should 


Copyrighted material 



Rigid Body Dynamics 437 


follow the following equation of motion 

p ~ -* - 

where A a is the effective surface area of the whole body that produces the drag. And during 
the passive part of the strokes, 

p = -icwu(£)’ 

Under steady state conditions, the time average of P over the full period of a stroke must 
vanish. Since the solution for p h as a function of t is given by 

the time average over a period 2 t a under the steady state conditions will give 

= \c oPw A„ <V*> (12.167) 

However in actual practice the return part of the stroke is made slightly longer than t a , in 
which case, the coefficient 0.5 on the right hand side of Eq. (12.167) will effectively increase. 
As seen from the above expression for p*(/) , the instantaneous velocity of the body V will 
approximately vary sinusoidally with an amplitude Vj over the average speed of swimming 
V c , which would give, after averaging 

< V 2 > = V* + 


Since both the legs and hands are producing the strokes, a more correct equation would 
be 

< V 2 > = V 2 + 1 (V/) h „ d + i (V)X (12.168) 

Substituting Eq. (12.168) in Eq. (12.167) and using the expressions for V/’s, we get 


57 m h g I - _ 5^ /m*\ 2 / A 0 \ _ 5* 2 nrrn fm\ 2 f A p \1 
6 CopvAol 16 \MJ \A k ) 16 7T7U 


(12.169) 


neglecting other types of relatively minor contributions coming from the thrusts produced 
by the chest and head. 

Putting A 0 = 0.6 m 2 , A h = 0.02 m 2 , At = 0.09 m 2 ,M = 50 kg, 7 = 
150, M/mi = 7.25, 7 , = 22 , M/m h = 23, C D = 0.7, p w = 1000 kg/m 3 , we get for 
the maximum attainable speed of swimming under most favourable cases 


V c = 2.0 m/s 

The Olympic records till the present suggest V c = 2.0 m/s in the free style, = 1.86 m/s in 
butterfly mode, = 1.80 m/s in backstroke, and = 1.59 m/s in the breaststroke. 


Copyrighted material 


438 Classical Mechanics 


For different styles of swimming, the exact numerical coefficient 5/6 in the leading term 
of Eq. (12.169) would vary a great deal, apart from the changes in the values of A 0 and 
A h . 

However, it should be noted that the treatment presented above is by no means complete. 
The physics of a complicated art like swimming has still remained a matter of research. 

12.28.10 Playing Tennis, Golf, Ping Pong and Base Ball 

Small and light weight balls are hit by hand-held tennis rackets for playing tennis, by 
hand-held golf sticks for playing golf, etc. The speed of the spinning balls often acquires a 
maximum exceeding 30 m/s. At such high speeds, two types of drag forces are experienced 
by the spinning balls, one is the usual quadratic law of drag force 

Fa = - -CoPaAvv 

and the other called the Magnus force of lift (after H. G. Magnus who discovered it in 1853) 
acts always perpendicular to the the flying ball’s velocity vector v and its axis of spinning 
n — u)/u) 

F m = ^C L p a Ar{u) x ®) 

where v is the instantaneous velocity of the ball, r = radius of the ball, A = irr 2 = 
effective cross-section of the ball, u — angular velocity of rotation, p a = air density, 
Co = drag coefficient for linear speed, and Cl = drag coefficient for the Magnus force. 
The last two quantities are given by the following approximate empirical formulae 

C D = 0.508 + ^22.503 + 4.196 2 5 

C L = [ 2.202 + 0.981 (^) _1 ] 

Hence the equation of motion of the centre of mass of the ball is simply 
mv = mg + F D + F m 

Usually a constant spin angular speed is assumed for the entire trajectory, and the equa¬ 
tion c f motion is integrated numerically, using the method of quadrature. It is found that 
the effect of backspin is to increase the range of the ball. For attaining maximum range, 
the back spin rate should be as high as possible and the optimum launch angle should de¬ 
crease with the increase in the initially given spin rate. Since the Magnus force always acts 
perpendicular to the direction of motion, it cannot affect the speed but the motion does not 
remain confined in one plane. It moves either to the right or to the left depending on the 
sign of the spin. 

12.28.11 Playing Acrobatics 

Acrobatics involve basic rotations such as somersault, twist and cartwheeling of the body 



Copyrighted 



Rigid Body Dynamics 441 


the moment of inertia tensor. Through various theorems on the moment of inertia and the 
properties of ellipsoid of inertia, the symmetries of the moment of inertia tensor are brought 
out. 

Since the moment of inertia of a body can remain steady only with reference to body 
frames, there is a necessity for developing a scheme for tackling dynamical problems with 
reference to body frames, which are generally rotating and are therefore grossly noninertial. 
The generalisations of the Newonian schemes in retaining the torque angular momentum 
relationship or developing the relevant equations of motion were successfully carried out by 
Euler. The addition of a centrifugal term in Euler’s equations of motion has been justified 
in the text in several ways. 

Free rotations of rigid bodies with Poinsot’s geometrical interpretations in terms of pol- 
hodes and herpolhodes, Euler’s analytical solutions in terms of Eulerian angles, and a com¬ 
bination of both in terms of body cones and space cones bring out all the subtle features of 
free rotations. 

The motions of symmetric tops under the action of gravitational torques about a fixed 
point in the rotating body are found to retain the same number of first integrals of motion 
as a freely rotating rigid body has. This property makes them ideally suited as the first 
step of extension towards understanding, as well as applying to, many natural examples of 
top motions in an analytic fashion. The introduction of the Eulerian angles help interpret 
the motions in terms of most fundamental rotations, such as precession, nutation and axial 
spin. 

Lastly, the motivation behind our consideration of a simplified dynamics of some sports 
and games has been to illustrate the richness of the motions of the human body as a rigid 
body. The physics of complicated events are usually not so complicated as they appear to 
be. Through these examples we have also tried to demonstrate how one would go about 
in order to formulate a physical problem, be it as complicated as the process of running or 
swimming. 


PROBLEMS 

12.1 A sphere can roll without slipping on a given plane horizontal surface. Find the 
equations for the nonholonomic rolling constraint, if (i) the surface is at rest, and (ii) 
the surface is a uniformly rotating heavy platform. 

Show that in the second case, the ball will make circular orbits with respect to the 
outside inertial observers. 

12.2 A sphere of radius a is pressed between two perfectly smooth parallel plates and 
made to revolve with uniform angular speeds Di and about some fixed axes 
perpendicular to their planes. Determine the motion of the sphere, and show that 
the path of the centre of the sphere is a circle described with uniform speed. 

12.3 A right circular cone of semivertical angle a rolls on a horizontal plane with its 
vertex at O. A second cone of semivertical angle 7r/2 - 2a rotates with its axis 
vertical and vertex at 0, in rolling contact with the first cone. If the line of contact 


Copyrighted materii 



442 Classical Mechanics 


of the first cone with the plane makes a complete circuit around the vertical in time 
T, determine the angular velocity of the second cone. 

12.4 If r(t) denotes the position vector of any particle in a rigid body with respect to a 
fixed point O in the body about which the body turns, show that the motion that 
sends r(0) into r(t) may be reproduced as a single rotation of magnitude 0(t) about 
an axis with unit vector n(t), and the Rodgrigue’s formula for finite rotation vector 
$ satisfies 

«(<) = (l + \?) ‘ + \$ X 

where u = instantaneous angular velocity vector, provided 
$ = 2tan ^0 n 

Also show that 

r(!) = (l + i(9 s ) ‘ [(l - i/J 2 )r(0) + i(/J r(0))/3 + f> x r(0)] 

12.5 Prove the following theorems on centre of mass and moment of inertia 

(i) Lagrange’s theorem (1783): If R be the position vector of the centre of mass at G 
of any given object measured from any arbitrarily chosen point O in the body, then 
prove that 

MR 2 = Y,rmr 2 i - \ £ £ -Tj \ 2 

t = i < = i i = i 

where is the position vector of the ith particle, and M = total mass of the body. 
Use this theorem to find out the centre of mass of a regular pyramid. 

(ii) If x, y, z are three mutually perpendicular intersecting axes set in a given body, 
prove that the sum of the MI about the three axes is given by 

Ixx + Iyy + Ixz = 2 ^ 771 ^? 

or in other words, prove that the trace of the moment of inertia tensor is invariant. 
Use this theorem to show that the moment of inertia of a circular hoop of mass M 
and radius R about an axis 45° with respect to the symmetry axis is 3MR 2 /4. 

(iii) Express the mass quadrupole tensor Dij = £m(3 r * r i. ~ r 2 ^'i) in terms of 
the moment of inertia tensor Iij. 

12.6 An object of mass M and moment of inertia I is initially at rest on a frictionless 
surface. If F be a force of constant magnitude whose line of application is always 
at a distance d from the centre of mass and its orientation with respect to the body 
remains always the same, show that the trajectory of the centre of mass of the object 
is a Cornu’s spiral. 

12.7 What is the height-to-diameter ratio of a right circular cylinder such that the inertial 
ellipsoid at the centre of the cylinder is a sphere. Is it possible to have a suitable 


Copyrighted material- 



Rigid Body Dynamics 445 


of inertia are harmonic functions of time 

_ 2mr 2 . , , mr 2 

I zz - ^ (1 + ecoscot) I xx = I yy = -g—( 2 “ ccosu;<) 

where e < 1. The sphere is simultaneously rotating with angular velocity Cl(t). 
Show that the z-component of ft remains approximately constant. Show also that 
ft(<) precesses around the z-axis with a precession frequency u) p = (3eft,/2)cosu;f 
provided ft , > oj. 


12.20 A symmetrical top is spinning with its vertex in contact with a rough horizontal 
plane. Initially the axis of the top is at rest a nd makes an angle ir/3 with the 
upward vertical, the spin about the axis being 2yJ Mgh/C. Prove that after the axis 
is released its inclination to the vertical oscillates betwee n 7t/3 an d 7t/2, and that in 
the latter position the angular velocity of precession is y/Mgh/A. 


12.21 If the outer surface of the earth is assumed to be an oblate spheroid (equatorial radius 
a, polar radius c) with the principal moments of inertia about its centre A , A and 
C , and it rotates with an angular velocity ft about the polar axis, show that the 
expressions for the effective acceleration due to gravity g ef r at the poles g p and at 
the equator g e are given by 


GM ( a - c 3 ftV \ 

* - i?- + — - 2 mr) 


and g p 


GM f 

a 2 V 


2 ftV\ 
1 + GM ) 


where M is the mass of the earth and G the Newtonian constant of gravitation. 


12.22 (i) Show that the condition for secular stability of a freely rotating rigid body (that 
is, with the angular momentum L = constant) having a moment of inertia I is given 
by the absolute minimum of the effective potential 


v - - v + \t 


(ii) Consider the example given at the end of section 12.27. Show that for u> 2 < g/a, 
the effective potential has an absolute minimum at 6 = 0, but for u; 2 > g/a , the 
minimum shifts to 0 = 9 C = cos - l {g/u) 2 a). Will the shot suddenly jump to the 
new state of stable equilibrium or do it gradually, if the above critical limit in u is 
crossed from below? Solve the equation of motion and describe the path it follows. 


12.23 Determine the period of small amplitude oscillations of a uniform hemisphere which 
lies on a smooth horizontal surface in the field of gravity, keeping its plane surface 
up. Consider also the more general case in which the sphere is arbitrarily cut by a 
horizontal plane and is made to oscillate about the symmetry axis. 


12.24 Do a better analysis of the power demand in walking. Suppose the power P is spent 
not only in raising the CG of the body (Pi) in each stride but also in accelerating 
And decelerating the legs (Pa = P - Pi) in each stride. Consider also the effect of 
finite foot length a/ in considering the length of strides a. Find the functional form 


Copyrighted material 



446 Classical Mechanics 


for P as P(v,a) and show that P has a minimum for some intermediate length of 
strides. 

12.25 Show that a backspinning ball while bouncing off from a surface always experiences 
a greater decrease in both translational kinetic energy and in total energy than a 
forward spinning ball, so that the ball slows down to a great extent. This is why a 
backspinning cricket ball gives rise to a very slow catch within yards of the striker. 

12.26 (i) Show that the maximum range of a triple jump is approximately twice the maxi¬ 
mum range of the long jump. 

(ii) If the maximum height of a high jump is H and the maximum attainable speed 
in horizontal running v 0 , find the maximum height achieved in a pole vault. 

12.27 (i) Why do you prefer to locate a spacious flat surface of a rock to sit upon, while 
during diving you prefer to use the minimum area of the water surface to dip in? 

(ii) Cats and monkeys jump from a height with any arbitrary initial configuration 
and always land safely on the ground having rotated their bodies in flight by the 
right amount. Does this violate the principle of conservation of angular momentum? 
If not, how? 

(iii) While walking, why do your hands swing in opposite directions? 

(iv) How will you explain the larger total surface area of the body of a man than 
that of a woman of the same body weight? 

12.28 A chain is suspended keeping both the ends at the same elevation. One end is 
released. Show that the free end can fall with an acceleration exceeding g. 

12.29 Prove the following theorem on elastic collision of two rigid bodies: 

The component of relative velocity of points of impact along the impulse direction is 
exactly reversed. 

12.30 It is known that a doubly asymmetric round pebble when left spinning on a flat 
surface about one of its asymmetric axes suddenly reverses its sense of spinning. 
This problem has been completely solved by Hermann Bondi only a few years ago. 
You can give it a try. 


Copyrighted 




Elasticity 


13.0 INTRODUCTION 

In reality no body is strictly rigid but many solid bodies undergo small deformations and 
return to their original size or configuration when the deforming agency is removed. Such 
bodies are known as elastic bodies and the property is known as elasticity. Thus a rigid 
body becomes a solid body when the rigidity constraints are minimally relaxed. But then, 
a solid body consisting of N particles must have assume 3 N degrees of freedom. Since 
the displacement of each particle is small, the immediate neighbourhood of the constituent 
particles is minimally disturbed. In this chapter we study the elastic properties of any 
solid body assuming that the body is a continuous medium consisting of an infinite number 
of particles and therefore having infinite degrees of freedom. Thus each point in the body 
changes its position under deformation. The deformations are assumed to be generally small 
compared to the extent of the body. A knowledge of tensors is a prerequisite for reading 
this chapter. 

The first law of elastic displacements came from Robert Hooke (1635 - 1703). A weak boy 
from birth, he suffered throughout his life, from chronic inflammation of his frontal sinuses 
in childhood, to severe indigestion, giddiness and insomnia in later years. No portrait of his 
exists today. When he was 30, in a meeting of the Royal Society, someone described him 
as ‘one who is the most, and promises the least, of any man in the world that I ever saw’. 
At the age of 18, he entered Oxford university as a student, and assisted Robert Boyle in 
designing air pumps. Boyle’s law was possibly due to Hooke! In 1660, he conceived the 
fundamental idea of a simple spring to control the oscillations of a balance wheel in a watch. 
A few years later, he invented the spiral spring, but lost to Huygens in the claim of priority. 
He published Lectures de Potentia Restitution or of Spring in which he stated his law of 
elasticity and its implications. 

However, the major stalwarts of this field are once again Euler, Lagrange, Cauchy, Lam6, 
Young and Poisson. Like Euler, Augustine Louis Cauchy (1789 - 1857) was a prolific writer 
of papers, producing more than 700 original papers, many of which are longer than 100 
pages. However, the modern treatment of elasticity in terms of tensor notations got its 
basic motivation from Albert Einstein, who used the concept of the stress-energy tensor in 
his formulation of the general theory of relativity. Landau and Lifshitz’s book on elasticity 
is strongly recommended for a detailed and complete exposition of the subject. 


Copyrighted material 



448 Classical Mechanics 


13.1 DISPLACEMENT VECTOR AND THE STRAIN TENSOR 

Let us assume that the position vector of any given point of an elastic body, say r = 
(x\,X 2 ,Xi) before deformation, changes to a new position vector r' = (x'jjZj.Zj) under 
the action of some deforming agent. The displacement vector defined by (see Fig. 13.1) 

« = r' - r (13.1) 

can be regarded as a continuous vector field defined over the whole body, so that 

Ui = x\ - X{ = u,(xj, 12, x 3 ) (13.2) 



Fig. 13.1 Displacement vector field t«(r) = r' - r, defined over the 
region of 3-D space occupied by the elastic body 


Let us now expand the displacement m(x i,X2,z 3 ) at any point in the body with position 
vector r , in a Taylor’s series about the origin O. This gives 


Ui(x 1,12,13) = Ui( 0 ,( 


fdUi\ i 1 , 



{dx i dx i ) rm9 x ’ xk 


(13.3) 


We further assume that for enough small elastic displacements, the linear terms are dominant 
over the second and higher order terms. In that case we get 


Ui(xi,X2,X 3 ) = U»(0,0,0) + OikXk 


(13.4) 


where 



are constant coefficients, independent of Xj, Z2, x 3 . It is now easy to see that apart from a 


Copyrighted 



Elasticity 449 


constant, the elastic displacements are linearly homogeneous in x k , that is, 


where 


1ii(*l,X 2 ,X3) = u,(0,0,0) + ^j~ Xk 
a ik ~ (^r^) = i.e. ajk’s are independent of r 


(13.5) 


Thus, under this assumption, the displacement at any arbitrary point (xi,Z 2 , x 3 ) of the 
body is the sum of the displacement at the origin of the coordinate frame and some linearly 
dependent expansion terms. The first term in Eq. (13.5) is just an additive constant which 
is the same for all points in the body. So one can interpret the first term in Eq. (13.5) as 
the uniform displacement of the whole body. Such a uniform displacement is nothing but a 
translation for the whole body, which cannot be regarded as a deformation. A deformation, 
by definition, implies displacement differing from point to point; otherwise, the configuration 
remains identical. Therefore, without loss of generality we can drop the first term from the 
expansion in Eq. (13.5) as we are interested in elastic deformations. Thus we get, for the 
displacement at any arbitrary point in the body, 


tti(Xi,X 2 ,X 3 ) 



Xfc = Q’ik^k 


(13.6) 


Here we have defined the set of quantities a ik for the purpose of the rest of the chapter. 
These are the rates at which non-translational displacements change for a unit shift of the 
position. Such quantities are by definition strains, the deformation over any unit distance. 
Since * is a vector, x is another vector, and a,* are constants (see Eq. (13.5)), the 
quotient theorem of tensors suggests that a** must be a tensor. Since the individual 
components of a,* are nothing but strain like quantities, the full set of nine quantities 
a,*, i = 1, 2, 3, j = 1, 2, 3 is called the strain tensor. So for homogeneous linear 
deformations, Eq. (13.5) requires in total 12 quantities to be fixed. Hence such a body has 
only 12 degrees of freedom, out of which 3 DOF are due to the translation of the whole 
body. 


u i / u 

Lg _ Leg W 

Uk 


k-t'axis 


Pig. 13.2 The components ti* and u* of the displacement vector are shown to vary along 
the Jb-axis, suggesting a possible tensor character of the coefficients a tk 


The individual component a ik is the strain produced in the ith component of the 
deformation if one moves in the direction of the fcth axis (see Fig. 13.2). From Eq. (13.6), 


Copyrighted material 



450 Classical Mechanics 


we can write 


Ui = i(o ik + a ki )zk + - a k i)x k 

_ 1 / dui du k \ 1 / du{ 

“ 2 + foTi) Xk + 2 \dT k ~ 

= e% k x k + Wi k z k 


dxi) 


x k 


(13.7) 


Here e ik = (a** + a**)/2 is the symmetric part of the tensor a** and a/,* = 

(a«fc - ati)/ 2 >s the antisymmetric part. 

Let us take 

r' = r + 60 x r 


so that 
Therefore 


Ui = UjibOjXi 


u ik x k 


\(dui du k \ 

2 \0x* dxi) Xk 

- [tij k 60 jX k - e k ji 60 jX k ] 
€ij k 60jX k 


(13.8) 


(13.9) 


= Ui 


Equation (13.9) means that the antisymmetric part of ai k corresponds to a pure rotation 
of the body as a whole. Proceeding in a similar manner, we obtain for ti k 


e ik x k = 0 


(13.10) 


from the definition of u, given by Eq. (13.8). Thus pure rotation corresponds to no strain 
at all, which is intuitively obvious. 

Thus u)i k corresponds to pure rotation and a pure rotation does not contribute to e<*. 
Since pure rotation does not bring about any deformation in the body, this part of Oj* does 
not correspond to any straining. Note further that any antisymmetric tensor of second rank 
can be viewed as an axial vector, say which in this case is given by 

u> = iv x « (13.11) 

Thus the symmetric part a k of the tensor a** represents the strain tensor in the true sense 
of the term. 


13.1.1 Properties of e** 

(i) It is a real and symmetric tensor of second rank. Therefore, it can have at most six 
independent nonzero components. So out of 9 independent components of <n k , 3 independent 
components are required for the antisymmetric part representing rotation, and the remaining 
six for purely elastic strain. 

(ii) This tensor can be diagonalised by a principal axes transformation. All it means is 


Copyrighted material 



Elasticity 451 


that one can choose a rectangular triad at the origin in the body, so that only the diagonal 
components e« exist and e<j = 0 for i ^ j. 

(iii) Any arbitrary rotational transformation (which is an orthogonal transformation) 
leaves the trace of the e^ matrix 

A = en + e22 + C33 

invariant. Note that 

A = e„ + e„ + = V-« (13.12) 


13.1.2 Dilation 


The rotationally invariant quantity defined by Eq. (13.12) is called dilation. The meaning 
of this term will be apparent from the following considerations. 

Let a volume element dV change to dV' under elastic deformation. Let us assume that 
the tensor a ik can be diagonalised, and choose the principal axes to be the coordinate axes. 
Let x'i be the component of the displacement vector along the ith principal axis. Then, 


Therefore 


dx'i = dxi( 1 + a,j) (x not summed over) 
dV' = dx'xdx'idx's 

= dx\dx2dx$(\ + an + a 22 4- <* 33 ) 
= dV{ 1 + e n + e 22 + 633 ) 

= dV( 1 + A) 


(13.13) 


Now we know that by definition the volume strain or the bulk strain of the body is given by 


dV 1 - dV 
dV 


= A = V* 


(13.14) 


where we have used Eq. (13.13). Thus A is simply the volume strain of the body. 

Again, it is to be noted that since the antisymmetric part of is given by a curl of 
some vector, it cannot contribute to the divergence of * as the divergence of the curl of any 
vector is zero. This result is consistent with the fact that if a body simply rotates through 
an angle, its volume does not change. 

Therefore, when A = 0 the volume of the body is not affected by the process of 
straining. 

Another way of looking at the meaning of A is the following. By a well known corollary 
of the divergence theorem we get 

A = div. = fonjijf m-ds] 

where S is the boundary surface of the body. However, since « is the displacement vector 


Copyrighted material 



452 Classical Mechanics 


field, the surface integral in the above equation is the change in volume 6V enclosed by the 
surface S due to deformation. Therefore, 

6V 

A = div * = lim — (13.15) 

v — o V 

Assuming that this limit exists and (for the homogeneous strain) is the same for the whole 
body, we see that A is, by definition, the bulk strain of the whole body- 

13.1.3 Shearing Strain 

Let us assume that for a given solid and a given vector field «i defined on it, the strain 
tensor a ik is symmetric, that is, u ; ik in Eq. (13.7) is zero. For a given solid, we say such a 
vector field u represents a purely strain producing displacement. We can then write, 

= Ci k X k 

= ( e ik - ^A^*)** + (13.16) 

= b ik x k + -A^ ifc i fc 

Here A is as defined by Eq. (13.12) and Si k is the Kronecker 6 symbol. Equation (13.16) 
defines a new tensor bi k . It is easy to see that the tensor 6 ,* is traceless: 

bu = 6 „ + 622 + 6 33 = - J^Afo = A - iA3 = 0 (13.17) 

Since bi k is traceless the first term in Eq. (13.16) does not have any dilation, that is, it 
does not give rise to any change in volume. We know that any nonzero strain that does not 
induce any change in volume is going to change the shape. Such a strain is called a shearing 
strain or a shear for short. Thus represents the shearing strain. 

The second term in Eq. (13.16) namely (foA/3) is isotropic in nature, with only nonvan¬ 
ishing diagonal components, each equal to A/3, so that its trace amounts to A or dilation. 
Thus the tensor (£,*. A/3) corresponds to the entire bulk strain. 

Thus any strain producing displacement can be decomposed into two parts; one related 
to the shearing strain and the other related to the isotropic bulk strain. 

However, in general, any arbitrary infinitesimal displacement leading to a homogeneous 
strain given by Eq. (13.5) can have four components: 

(i) translation of the whole body ( number of DOF = 3), 

(ii) rotation represented by the antisymmetric part of a,* ( number of DOF = 3), 

(iii) an isotropic change jfa k Ax k corresponding to a change in volume without any 
change in shape ( number of DOF = 3) and 

(iv) a shear given by bi k x k corresponding to a change in shape without any change in 
volume ( number of DOF = 3). 

Note that the first two motions also apply to a rigid body. Thus the extra degrees of 
freedom possessed by a homogeneously strained elastic body show up through motions given 
in (iii) and (iv) above. That a change in volume corresponds to 3 DOF is obvious. Since 
the strain is homogeneous the displacements Ui at any two points in the body are not 


Copyrighted material 



Elasticity 453 


independent. In fact, once the shearing displacement m at the origin is fixed, those at any 
other point in the body are also fixed, through the first term of Eq. (13.16) giving rise to 3 
DOF for the shear. Thus the total number of DOF for a homogeneously strained body is 
12 . 

There may be a case of straining called torsion which falls in the category of shear, and 
which may not produce any visible change in shape, if the body is under torsion about its 
symmetry axis. This is in fact a case in which straining occurs internally, without showing 
any change in the size or the external shape of the body. 

In the case of pure strain, that is, when u= 0, Eq. (13.11) gives V x u = 0. 
Therefore, * can be conveniently derived from a scalar potential function called strain 
potential <f>(xi,x 2 ,x 3 ) through the relation 

« = - (13.18) 

Equation (13.18) means that the displacement u is always perpendicular to the equipotential 
surface for the strain potential <f>. Further, for pure shear A = V u = 0 giving 

V 2 <f> = 0 (13.19) 

which means that, for pure shear, ^ is a harmonic function. 


13.1.4 Strain Ellipsoid 


Let us draw a unit sphere about the origin (taken at an arbitrary point in the body) and see 
how it looks like after a small homogeneous elastic deformation has taken place throughout 
the body. For simplicity we choose the axes to be the principal axes of the strain tensor 
eik • Let the principal components of dk be ei, and es respectively. The displaced 
coordinate x[ along the *th principal axis, for any point on the unit sphere, is then given 
by 

x\ - Xi = Ui = dXi 


or 

x'i = x<(l + e<) 

This gives, for the radius of the sphere, 


(* is not summed) 


1 = x\ + x\ 


(13.20) 


Hence the deformed unit sphere is an ellipsoid. This ellipsoid is called the strain ellipsoid 
(see Fig. 13.3). The volume of the strain ellipsoid is 1 + A. Hence a sphere of radius 
1 + 5 A will have approximately the same volume as that of the strain ellipsoid. 


If t\ £ e 2 ^ € 3 , the directions of r', r and «. are all different (see Fig. 13.3) except 
when r is along any one of the principal axes. Hence, in general there is shear. But the 
geometrical structure of a general shear is not a simple one. The circles of intersection 
between-the strain ellipsoid and the sphere of radius R = (1 + (A/3)) form the bases of 
two cones about which a general shear occurs (see Fig. 13.3). 


Copyrighted material 



454 Classical Mechanics 


T 



Fig. 13.3 A section of the strain ellipsoid in the z-z plane. The original 
circle (actually sphere) is deformed to an ellipse (ellipsoid) with a 
change in area (volume), keeping however, certain directions (on 
the surface of a cone) undeformed 

It is customary to express general shear as a combination of three simple shears about 
any three mutually perpendicular (triad) axes, with respect to which Cjjs are available with 
nonzero off-diagonal elements. Any off-diagonal term a, corresponds to a pure shear about 
the shearing axis it = i x j with the amount of shear equal to 2ey, which, by definition, 
is the angle of shear. A symmetric pair of terms (e^ = , i ^ j) imply no stretch 

along the k = i x j direction. In this case a stretch of amount along the positive 
bisector of the i-j axes equals the stretch of amount along the negative bisector of the 
t, j axes. The combination of these three simple shears (that is, off-diagonal elements of 
eij tensor) is somewhat difficult to imagine, but Fig. 13.4 may help visualise such a case. 

In Fig. 13.4, □ OPRQ -» □ OP'R'Q' after shearing in the plane of the diagram due to 
the terms ey and e^». 

Angle of shear = ZPOQ - ZP'OQ' 



— €ij + tji — 2 eij 

The change in area due to shear is given by (see Fig. 13.4) 

□OP'R'Q' - DOPRQ = XiXj - XiXj( 1 - e%) = (13.21) 

This change is of second order in which implies that the shear does not change the 


Copyrighted 



456 Classical Mechanics 


The stress tensor gives us the full information on the nature of the restoring forces that 
operate at any given point inside the body specifying both the normal and the shearing 
stresses acting on all the sides of an infinitesimal cubic volume centred about the given 
point. We use the Cartesian axes for specifying the directions and the surfaces of the 
elementary cube. 

The stress tensor is a tensor of second rank, so that it requires two indices to specify 
its elements. The first index stands for the component of the force that it represents and 
the second index specifies the surface on which the above force component acts. The index 
for a surface is given by the index for the outward drawn normal to the surface. Thus the 
ijth component, , of the stress tensor <r corresponds to the ith component of the force 
that acts on a unit area of a surface having an outward normal parallel to the jth axis. 
Obviously <r xx , a yyy a zz are the normal stresses while a zyy a yzy ..., a zzy <r zz are the 
shearing stresses (see Fig. 13.5). Some books however, follow an opposite convention for 
indexing ij namely, the first index i for the surface and the second index j for the force 
component. To specify all the elements of the tensor <r<j, note that for i y j running over 
x,y,z axes, we have 


(Tij = Force in the ith direction acting on unit surface area perpendicular to the j axis. 
By definition 


(Tij = lim 

3 A Sj - o A Sj 


_ ith component of the force acting on the the surface area A Sj 

~ asJ“ 0 A Sj 


Therefore 


A = (Tij AS j where A Sj = CjkiAxkAxi 


(13.22) 


Since both A/* and A Sj are components of vector quantities which are not parallel in 
general, by the quotient theorem, must be a tensor of the second rank. However, unlike 
the strain tensor e^-, the stress tensor (Tij need not always be a symmetric tensor. Now, 
being a vector by nature, any arbitrary surface (area) element dS can be decomposed into 
three rectangular components such as 

dS = dS\t + dS^j + dS 3 k 

The net force dF on the surface area element dS due to stresses a y j is given by the 
components 

dF y = (TijdSj (13.23) 

Therefore, the total surface force, acting on the volume V enclosed by the total surface 
area S is given by 

Ft = jf (TijdSj 


Copyrighted material 



458 Classical Mechanics 


experienced by the volume V is 

f pFidV 
Jv 

Here i stands for the ith component of the body force and p stands for the mass density. 
Hence the ith component of the total force on the enclosed volume V is 

IM + ^ iv 

which must vanish if this enclosed portion is to be in translational equilibrium. As V 
is arbitrary, the condition for translational equilibrium at any point of any continuously 
deformed elastic body is given by 

pi + pFi = 0 (13.26) 

OXj 

where F is the external body force per unit mass of the body. In the absence of such an 
external body force, the condition for translational equilibrium reduces to 

^ = 0 (13.27) 

OXj 

The conditions (13.26) or (13.27) are a set of three scalar equations, each containing 3 
terms due to the implied summation over j. 

13.2.2 Conditions for Rotational Equilibrium of Elastic Bodies 

Let us take the total moment T of the forces about the origin, that act on a chosen domain 
of the elastic body enclosing a volume V with a boundary surface S. We have, 

= J p{r x F)idV + £ (ij k Xj(r k idSi 

The second integral can be transformed using Gauss’ divergence theorem, so that 

Ti = J v ei i kX > FlePdV + J y ( e ijk x j<rhl)dV 

= J tijkXjFkpdV + J eijkXj^^dV + J CijkhjiOkidV (13.28) 

Now let us assume that the body is in translational equilibrium or, in other words, there 
is no net force on the body. The first of the above integrals vanishes. Now such a body 
cannot experience a net torque, which requires that = 0. Hence the integrand tijk^kj 
summed over all j and k must vanish. This means 

<T jk = a k j (13.29) 

In other words, the stress tensor must be symmetric in order that the conditions of 


Copyrighted material 



Elasticity 459 


translational and rotational equilibrium be simultaneously satisfied. These conditions are 
fairly basic conditions since the body must remain in a rotational equilibrium state in the 
absence of any net force acting on the body. Thus if we derive a stress tensor which is 
not symmetric to start with, we must symmetrise it before using it for studying a physical 
situation involving a stress tensor. Another way to look at the above result is that, if 
£ <Tkj, then the torque produced by the aj k component will not be balanced by 
that produced by the <r ki component. An equal but opposite torque is produced by <r k j 
provided Eq. (13.29) is satisfied. 

Since is a symmetric tensor of second rank it can have at most six independent 
components. 

Again, just like the strain tensor e^, <r k j should also have the following properties. 

1. It can be diagonalised by a principal axes transformation, reducing only to the normal 
stresses (that is, all shearing stresses vanish for the planes perpendicular to the principal 
axes). So at every point inside an elastic body, one can orient the infinitesimal cube centred 
at the point in such a way that the cube does not experience any shearing stress. The axes 
of this cube are then the principal axes of the stress tensor at the given point. 

2. The stress tensor also forms an ellipsoid similar to the strain ellipsoid. The stress 
ellipsoid represented by the equation 

ij> = (TijXiXj = 1 (13.30) 

is called Cauchy’s stress quadric. The axes of the ellipsoid correspond to the direction of 
principal tractions and obviously there is no tangential stress in any of the principal planes. 


13.3 Strain Energy 

A strained body acquires some energy due to the work done on the body during the process 
of deformation. We imagine the process of this deformation to be of the virtual type, that 
is, we only allow the displacement but no instantaneous velocity of the displacement (the 
latter can happen only if finite real time is allowed to elapse). However, if we account for 
this velocity, it will contribute to the energy of the system in the form of kinetic energy, and 
as soon as the body reaches an equilibrium configuration under the action of deformation 
forces, the kinetic energy part disappears. As our main interest is to find the energy of 
strain, we drop the kinetic energy associated with the actual process of straining and wait 
until it reaches the equilibrium state. This is what we mean here when we assume the 
displacement to be virtual. 

Consider an elastic body occupying a volume V, enclosed by a surface 5 before the 
deformation takes place. The virtual work done on this elastic body due to deformation, 
resulting in an virtual displacement field u* is given by 

SW = j^pFiSuidV + j> a ik dS k 6ui (13.31) 


Copyrighted mi 



Elasticity 461 


and for straining at constant entropy and volume, the total differential of the Gibbs function 
G'(T',p') becomes 


dG' = - S'dT' + V'dp' = - S'dT' - e ik d<r ik (13.38) 


From Eqs (13.36) and (13.37) we get 


and from Eq. (13.38), 


. (m 

1 = ( 

'dF'\ 

\de ik ) 

S' ' 

^deik) 

c,* = - 

/ dG' y 

\ derik t 

l 


(13.39) 

(13.40) 


These are the basic relations obeyed by the stress and strain tensors through the thermo¬ 
dynamic potentials. If a thermodynamic potential, such as the free energy for any deformed 
elastic body, is known as an explicit function of the strain tensor at, then the stress tensor 
for such a body can be found out by using Eqs (13.39). 

We expect that F' should be an explicit function of T' and e^, say 


F' = F' 0 {T') + Cik[T')eik + ■^Ciki m (T')eikeim + yCikimpqiT^eikeimepq + ••• 
when expanded in Taylor’s series of e**. 


13.4 POSSIBLE FORMS OF FREE ENERGY AND STRESS TENSOR 
FOR ISOTROPIC SOLIDS 

By definition, the physical properties of any isotropic solid should not depend on any specific 
orientation of the body in space. More specifically, under any arbitrary rotation the physical 
properties of an isotropic body, say the total free energy content of the body (which is a 
scalar), must remain invariant. However, by Eq. (13.37), free energy depends on the 
tensorial quantities, such as the stress and strain tensors, the components of which strictly 
depend on the choice of orientation of coordinate axes in space. Thus it is necessary that 
we express the free energy as functions of scalars only; these scalars may be formed out of 
the Cik tensor. 

We know that of all the scalars formed out of a second rank tensor, its trace and magnitude 
are the only two scalars that have no terms in the cubic or higher order products of the 
individual elements of the tensor. Therefore, these two quantities, given by 

e»i = en + e 2 2 + e 33 
and 

(c<jt) 2 = e ik e ik = e\ x + e\ 2 + e\ z + 2e\ 2 + 2e 2 13 + 2e\ z (13.41) 

do not change under arbitrary rotations. 

How would be the explicit dependence of F' on e,*? Since in equilibrium (in the absence 
of any external stress) e^ = 0 and ait = 0, from the definition of <Tik given by Eq. 


Copyrighted material 



Elasticity 465 


terms. Since e»j is a tensor of the second rank, Cijki must be a tensor of the fourth rank, 
and is called the most general stiffness constant. The stress tensor <nj corresponding to 
this free energy is given by 

Vi > = {w) T , = c,iuei ‘ (13S0) 

Thus, for small deformations, the stress tensor depends linearly on the strain tensor e*/ 
since C^ki are constants. This is called the generalised Hooke’s law of elasticity. Robert 
Hooke gave this law in its simplest from in 1675. 

In order to arrive at this law we had to neglect the cubic and higher order terms in e<j 
in our expression for the free energy. So we can turn around to say that if Hooke’s law is 
to be accepted as an empirical law of nature, the cubic and higher order terms in in the 
expression for F' should not dominate the quadratic terms. In fact, there are cases in which, 
even for small e^’s, the coefficients C’s are such that the cubic terms become comparable 
to the quadratic ones. 

As it stands C^ki has 3 4 = 81 components all of which need not be independent. 
The maximum number of independent components in Cijki tensor depends crucially on the 
symmetry properties of <T{j and eti which in turn depend on the geometrical symmetries 
of the solid under consideration. 

We shall follow Voigt’s analysis of the symmetry properties of Cijki- To start with, Voigt 
in 1887 introduced a new system of indexing. Noting that both Oij and eij are symmetric, 
we have only six independent components for each of the tensors so that one can treat them 
as six dimensional vectors. According to Voigt’s notation e,j (i,j = 1,2,3) is replaced by 
Si (* = 1 , 2 ,... , 6 ) and <r,j by Ti(i — 1 , 2 ,... , 6 ) with the correspondence 

11 -» 1, 22 -» 2, 33 -4 3, 23 - 4, 31—5, 12 —> 6 

We can therefore express the free energy in Voigt’s notation as 

F' = K + ^CyS.S, (13.51) 

for small deformations Si. Cij are the elastic stiffness constants which now number 36 in 
all instead of 81 according to the previous notation. The generalised Hooke’s law (13.50) 
now takes the form 

dF' 

Ti = = CijSj i,j = 1 , 2 ,..., 6 (13.52) 

Since the free energy is a scalar, one could also write it as 

K + 

as both i and j are summed; but it would now give Ti = = CjiSj implying that Cij 
should be a six-dimensional symmetric tensor of rank 2 . 

Therefore, the total number of independent components in is 6 2 -®C 2 = 36 - 15 = 
21. If a .solid is devoid of any symmetry as is the case with triclinic system, the maximum 
number of independent stiffness constants would be 21 . 


Copyrighted materi 



466 Classical Mechanics 


Again, since the free energy F' has to be a minimum around F' 0 for a stable equilibrium 
against any arbitrary small deformation, the matrix must be positive definite, and 
therefore, nonsingular and invertible. Thus there exists an inverse of Hooke’s law 


* = C^Ti 


(13.53) 


such that C[j is the matrix inverse of Cij- The coefficients C[j are thus called the compliance 
constants. 

For a monoclinic system there would be, in total, eight symmetry restrictions to be 
imposed. This leads to 13 independent stiffness constants. Similarly, there will be only 
9 independent stiffness constants for orthorhombic, 6 for trigonal and hexagonal, 5 for 
tetragonal and 3 for cubic crystals. For example, a cubic crystal would have the following 
nonzero stiffness constants: 


(C\\ C \2 C \2 0 0 0 \ 

C \2 C\\ C \2 0 0 0 

r _ C \2 C12 C\\ 0 0 0 

~ 0 0 0 c 4 4 0 0 

0 0 0 0 C44 o 

\ 0 0 0 0 0 C44 / 

Therefore, for cubic crystals, 

F' = K + \c n {S\ + Sf + S3) + Ci 2 (S 1 S 2 + S 2 S 3 4- S3S1) 
+ -C44(S| + sj + Sf) 


(13.54) 


The three independent stiffness constants are, C\u C12 , and Ca. 

For isotropic solids, there is one more symmetry with respect to the cubic case namely the 
choice of the axes. In the case of a cubic crystal the choice of the rectangular axes cannot 
be arbitrary, but for an isotropic solid this also becomes arbitrary. Hence in the expression 
.for the free energy an extra condition appears in the form of 


C\\ - C \2 — C44 

One can now easily identify C \ 2 with A, C 44 with 2 /i and Cn with A + 2 n for isotropic 
solids. 


13.7 ELASTIC PROPERTIES OF ISOTROPIC SOLIDS 

The two independent elastic constants for isotropic solids are Lame’s constants, A and /x. 
Since n is the rigidity modulus, in some sense, it should also define the rigidity. If n 
is everywhere infinite so that dj vanish for any finite the substance is an ideal rigid 
body. If fi is finite but nonzero then the substance is a perfect solid. Finally, if /x = 0 the 
substance is a perfect fluid, that is, there cannot be any shearing stress developed in the 
motion of this fluid. Only the nonzero normal stress components will exist and correspond 
to the hydrostatic pressure p, so that for ideal fluids, from Eq. (13.43) with /x = 0, and 


Copyrighted material 



468 Classical Mechanics 


|» > 0 we get for Poisson’s ratio, from Eq. (13.58), 

- 1 < er' < i (13.60) 

However, Eq. (13.57), coupled with the conditions A > 0 and n > 0 gives 

0 < <r' < i (13.61) 

Observationally, no substance has been found with a negative value of <r\ confirming the 
general validity of the condition (13.61) in preference to (13.60). For a rubber-like object 
a' —* 1/2, for earth-like solids A ~ p giving er' = 1/4. 

13.7.3 Interrelation Between (ru and e\k 


Equation (13.43), written in terms of (/f,p) and (Y,<r') respectively look like, 

*ik = ^ A6 ik + 2 ne ik (13.62) 

or 

rf^h + r^WH (13 - 63) 

Inverting Eq. (13.43) we get 

eifc = 2^ [ ffik ~ ^A^fc] 

To express A in terms of we define the trace of the stress tensor 

£ = <Ta = ffn + 022 + 033 — AA^jj 4- 2 lien — A(3A + 2/t) (13.64) 


Using Eq. (13.64) we get 


= 2/1 |'« 


-6 ik 


3A -I- 2 n 

This expresses eifc in terms of the <r if the latter components are known a priori. 


(13.65) 


13.7.4 Conditions for Translational Equilibrium Applied to Isotropic Solids 


The condition for translational equilibrium of an isotropic solid under the action of an 
external body force g per unit mass is given by Eq. (13.26), 


P9i + 



Copyrighted material 



Elasticity 469 


Using Eq. (13.43) alongwith the definitions of e,* and A, we can write, 


dajk 

dxk 


A 6 ik 


dA 


dxk + ^ dxk 


d 


dA d 2 Ui 

= xs < k dr k + "&r 

= < A + + ^ 


( duj du k \ 

\dx k + dii) 

d fduk\ _ .dA 

^ dii / dxi 


d 2 ui dA 
+ 11 84 + 


Q 2 

V 2 = ) : Laplacian operator 

* & x k 


Therefore the condition for translational equilibrium is 

pg + (A + p)V(A) + /xV 2 ti = 0 (13.66) 

Note that V 2 u is a vector quantity with components V 2 u, (i = 1,2,3). Again, using a 
well known vector identity for ‘curl of curl’ we get 

V x (V x u) .*= V(V • u) - V 2 ti = V(A) - V 2 u 


or 

V 2 « = V(A) - V x (V x u) 
Substituting in Eq. (13.66) gives 


P g -KA + 2/<)V(A) - pV x (V x ti) = 0 (13.67) 

which is another form of the translational equilibrium condition (Eq. (13.66)). Yet another 
form can be obtained by taking the divergence of Eq. (13.67) which gives 

pV g + (A + 2/z)V 2 (A) = 0. (13.68) 


For a gravitational field V g is given by Poisson’s equation, that is, V g + 47 xGp = 0, 
so that for any isotropic solid resting in a gravitational field we must have 


V’(A) = 


47 rGp 2 
A + 2p 


Usually Gp 2 A + 2p, giving V 2 (A) ~ 0. We can further operate a Laplacian V 2 on 
both sides of Eq.( 13.66) to get 

V 4 u = 0 (13.69) 


that is, a biharmonic equation is satisfied by the displacement vector field Vj. Equation 
(13.68) is also valid for g = constant throughout the body. 


13.8 PROPAGATION OF ELASTIC WAVES IN ISOTROPIC ELASTIC ME¬ 
DIA 

In this case the body is no longer in translational equilibrium but each point of the solid 
responds with an acceleration d 2 u/dt 2 , where u is the displacement at an arbitrary point 


Copyrighted materiJ 



470 Classical Mechanics 


(x,y,z). 

The equations of motion are 


Pm 

9 dt 2 


P9i + 


d<Tik 

dxk 


or, in vector notation and using Eq. (13.66), 


p w = 99 f (A + m)V(A) + /iV2 * 


a 2 * 

9 dt 2 


P9 + (A + 2p)V(A) - //V x (V x *) 


Now taking the divergence of Eq. (13.72), we get 

^A 


dt 2 


= p(V-f) + (A + 2p)V 2 (A) 


and taking the curl of Eq. (13.71) gives 


(13.70) 

(13.71) 

(13.72) 

(13.73) 

(13.74) 


P-jfp = 2p(V x §) + /iV 2 w 
where we have used Eq. (13.11) for the definition of u. 

Equations (13.73) and (13.74) represent the wave equations for the propagation of a A- 
wave and an w-wave respectively. The first one is a wave of dilation and compression and 
the second one is a wave of rotational displacement. It is to be noted that if g corresponds 
to the gravitational field V x g = 0 and V g = - AnGp, so that Eq. (13.74) 
becomes independent of the existence of any external field derivable from any potential, 
such as gravity, 

d ~v ^-2 

P W ■- 

and in Eq. (13.73) one may neglect div g term if Gp 2 < A -f 2p, then, 


o2 A 

= ( A + 2p)V 2 (A) 


(13.75) 

(13.76) 


and the general Eq. (13.71) takes the form 
d 2 n 


9 !h 2 = ( A + •«) + /iV 2 « 


(13.77) 


Obviously, from Eq. (13.To,, tne speed of propagation of the compressional wave in an 
isotropic solid is 

/A + 2p j~K • 4/*/3 
\/—• - \ p~ 


C, 


(13.78) 


Copyrighted material 



472 Classical Mechanics 


When the condition (13.84) is satisfied, Eq. (13.82) becomes 



(13.86) 


which can be satisfied only if the propagational wave vector k is parallel to u, the displace¬ 
ment vector. This means that the displacement takes place in the same direction as the 
plane wave pro pagates. Such a wave is called a longitudinal wave and its speed of propaga¬ 
tion is Ci = -^/(A 4- 2 n)/p which is the same as the speed of propagation of rarefaction 
and compression, (see Eq. (13.78)). 

For the case (b) we need k‘(dPf/d0 2 ) = 0 which means that / (= «) is perpendicular 
to k , if the form of / is that of a plane sinusoidal wave. So the displacement * is transverse 
to the direction of propagation ( 4 ). These are the so called transverse waves. Now from Eq. 
(13.82) with condition (b) above, we get, 


pa/ 2 = nk 2 


(13.87) 


so that the speed of propagation of the transverse waves is 



It is also possible to eliminate all the vector components of d 2 f/d0 2 from the vector 
Eq. (13.82) and a general dispersion relation a j'(k) can be obtained in the form of a secular 
equation. Such a dispersion relation turns out to be a sixth degree polynomial in u' with 
the solutions given by 

— = ± Ci, ± Ct and ± Ct 

implying that there can be two independent transverse modes and one longitudinal mode of 
elastic waves, each being capable of propagating in both the forward and backward directions 
(for each mode), in any isotropic solid. These two transverse modes mimic those of light 
waves propagating in vacuum in the sense that both cases require two degrees of freedom 
for the description of their polarizations. They are normally referred to as SH (secondary 
horizontal) and SV (secondary vertical) polarization states for the transverse mode of elastic 
wave propagations in isotropic solid media. 


13.8.2 Seismic Waves 


The message of an earthquake is transmitted through the solid body of the earth in the 
form of propagating elastic waves. One can approximate the earth as an isotropic solid with 
A ~ n and a' ~ 0.25 so that Ci : C t ^ v/3 : 1. In the language of seismology 
the longitudinal or compressional waves are called P waves (push wave, or primary wave or 
pressure wave) and the torsional waves are called S waves (shake wave, secondary wave, or 
shear wave). Obviously P waves arrive earlier than S waves. The time difference of their 
arrival at a seismic station carries the information of its distance from the centre of the 
earthquake. 

Sound waves, like light waves, undergo regular reflections and refractions across the 


Copyrighted material 



Elasticity 472 


boundary between two layers of the earth having different density structures. The relative 
intensities of the seismic waves that are detected at a large number of stations distributed 
all around the world, can give the most useful information on the density structure of the 
interior of the earth. We also know that S waves cannot propagate through any perfect 
fluid. Hence the intricate pattern of the various reflected and refracted P and S waves from 
different layers of the earth’s interior, can also suggest whether the whole of the interior is 
in the solid state or not. 

For example, it is observationally found that no seismic centre receives any strong P wave 
signal from any earthquake centre if the epicentre of the earthquake lies between 105° and 
142° (of the geometric arc) from the receiving centre. This zone is known as the shadow 
zone of the seismic P wave and results from the reflection of P and S waves from a surface 
of major discontinuity in density and structure lying at a depth of 2900 km from earth’s 
surface. This boundary is called the core mantle boundary across which the speed of P waves 
drops abruptly from 13.6 km/sec to 8.1 km/sec and that of the S wave from 7.3 km/sec to 
practically zero. It means that the earth’s core is in a fluid state. However, there are certain 
other complicated features of the P and S wave reflections, that suggest the existence of an 
inner central core (radius = 1250 km) in a rather solid state. 


13.9 SUMMARY 

When compared to rigid bodies, homogeneously and linearly strained solid bodies have twice 
as many degrees of freedom. Out of these 12 degrees of freedom, three are for translation, 
three for rotation, three for dilation and the rest three for shearing. Dilation corresponds to 
the trace of the strain tensor, and the shearing is contributed by the off diagonal terms. In 
the same way as the concept of inertial ellipsoid was introduced in chapter 12 for motions of 
rigid bodies, the idea of strain ellipsoid is useful for the description of homogeneous elastic 
strains in elastic bodies. Stress is viewed as a tensor of the second rank. If the rotational 
equilibrium is a desirable consequence of translational equilibrium of a strained solid body, 
the stress tensor has to be symmetric. Again, since stresses are supposed to produce strains, 
the two must be related. For small deformations, this relation is universally linear in nature 
and gives rise to what is called the generalised Hooke’s law of elasticity. But the directions 
of strains are not, in general, parallel to the directions of applied stresses, requiring that 
the constants of proportionality in Hooke’s law must behave as tensors of the fourth rank. 
Voigt’s ingenious scheme of notation are used to clearly demonstrate that most general 
solids with least symmetries in its lattice structure will have at most 21 independent elastic 
constants. The solids of the cubic class have three and the isotropic solids have only two 
independent elastic constants. 

The propagation of elastic waves in an isotropic solid medium can take place in all three 
perpendicular directions; the longitudinal one is a propagation of the pressure wave and the 
two degenerate transverse modes correspond to the rotational or shear waves. The speed of 
propagation of the first one, called P wave, is considerably higher than those of the shear 
waves called the S waves. This fact is used for inferring the distances to the epicentres of 
the earthquakes from the seismic stations. 


- I 



14 

Fluid Dynamics 


14.0 INTRODUCTION 

In this chapter we deal with motion of matter in a continuous form that exists in the fluid 
phase, where any relative motion between various parts of the system is permissible. Obvi¬ 
ously the number of degrees of freedom becomes virtually infinite, and discrete mechanics 
cannot handle the situation. An entirely new technique is required, which was primarily 
developed by Daniel Bernoulli and Leonhard Euler. 

Daniel Bernoulli (1700 - 1782), son of Jean Bernoulli, was born in Netherlands. Hav¬ 
ing got his doctorate in medicine, he became a professor of mathematics at St. Petersburg 
Academy in 1725. He immediately invited Euler to join the same institute, and both of them 
had a good time there. It was during his stay in St. Petersburg that Daniel finished his 
book Hydrodynamica sive de iliribus et motibus fluidorum commentarii in thirteen chapters, 
which was finally published in 1738. Having had enough of Euler and Russia, he returned 
to Basel in Switzerland to become a professor of anatomy and botany and of natural phi¬ 
losophy. However, his father, Jean Bernoulli did not like his son’s book on hydrodynamics, 
particularly the ad hoc nature of the equation of motion (the so called Bernoulli’s equations 
of motion), and began to criticise the book openly, in spite of the fact that Bernoulli’s 
equation did explain a lot of natural phenomena. 

So D’Alembert got interested in the study of the motion of fluids from the mechanical 
point of view, following the criticism of Jean Bernoulli, and published a book in 1744 entitled 
Traiii de Tiquilibre et du mouvemenl des fluides. D’Alembert also studied the vibrations of 
strings and obtained the second order partial differential equation for propagating waves in 
1747, together with the most general solution. (All these are duly named after D’Alembert.) 

Real progress took place only when Euler became interested in fluid dynamics, and de¬ 
cided to resolve the crises raised by the works of Bernoulli and D’Alembert. In 1755, Euler 
presented a number of papers to the Berlin Academy on the theory and practice of hydro¬ 
statics. There he derived his famous equation of motion for perfect fluids, and as a special 
case derived the Bernoulli equation in a rigorous manner. Even today, people feel uncom¬ 
fortable about the Bernoulli equation which defies common sense in many respects. We 
shall try to justify this with the most recent interpretations. 

Apart from Bernoulli and Euler, significant contributions have come from Lagrange, 
Cauchy, Poiseuille, Jacobi, Reynolds, Stokes, Kelvin, Helmholtz, and innumerable other 
workers. Inclusion of viscosity, turbulence and chaotic behaviour has made modern fluid 


Copyrighted material 



Fluid Dynamics 477 


dynamics flourish in all its diversity, and it is at present one of the most rapidly developing 
branches of classical physics. Its immediate application to plasma physics and magneto- 
hydrodynamics bring it to the focal point of all such studies, and therefore, one should be 
properly introduced to the subject, which is the aim of this chapter. 


14.1 A FEW BASIC DEFINITIONS 

Basic Fluid Dynamical Variables: We assume the system to be a continuous medium and 
at each point of such a medium there exist scalar point functions, namely the pressure 
p(x,y,z,t) and the density p(x,y,z,t). If these are also functions of time, the medium 
must be in a dynamic state. The specification of the state of motion of any constituent 
particle at any point (i,y, z) is made by defining a vector point function (a vector field) 
called the velocity vector field q(x,y,z,t). The coordinates of any point are always referred 
to with respect to a fixed frame of reference. These five quantities p{r,t), p{r,t) and g,(r,t)> 
* = 1,2,3, are regarded as the five basic dynamical variables of the fluid system in motion. 

Fluid: A fluid is a continuous medium which has the property that, when it is in (dynamical) 
equilibrium, the shearing stress must vanish at every point and the pressure function must 
completely specify the stress tensor. A fluid is said to be in equilibrium when the quantities 
p, p and q do not change with time for an observer moving with a given element of the fluid 
(see below). 

Liquid: A fluid is said to be a liquid if it is possible to confine it in such a way that it 
exhibits a free boundary surface while it is in equilibrium. 

Gas: A fluid is said to be a gas, if any attempt to confine it in a given bounded region defined 
by rigid boundaries makes it expand to the extent of completely filling the container. 

Compressibility: When an increase in the stress acting on a fluid results in a decrease in 
volume or a proportionate increase in the density, the fluid is said to be compressible. If 
no change in volume occurs subject to any finite change in the value of the stress, the 
fluid is said to be incompressible. To a first approximation most liquids can be regarded as 
incompressible. 

Hydrodynamics: It is the study of motion of incompressible fluids. In fact any prefix hydro 
implies the assumption of incompressibility. This definition of hydro is applicable even to 
magnetohydrodynamics. 

Hydrostatics: It is the study of the properties ofrincompressible fluids in equilibrium. 

Fluid Dynamics: This deals with the motion of both compressible and incompressible fluids. 

Perfect Fluid: We have seen that a fluid in equilibrium cannot develop any tangential stress 
in it. Some fluids can still satisfy this condition even if they are not in a state of equilibrium. 
Such fluids are called perfect or ideal fluids. Those fluids which develop shearing stresses 
while having a differential motion are called imperfect or non-ideal fluids. There are further 


Copyrighted materi; 



478 Classical Mechanics 


classifications of imperfect fluids which we shall introduce later. For the present we shall 
assume that the fluid is perfect, that is, the relation, 

<7ij = —pSij 

is valid for all possible states of motion of the fluid. 


14.2 THE CENTRAL PROBLEM OF FLUID DYNAMICS 

When the fluid is in motion, its dynamical state at any instant can be completely specified 
by the knowledge of the five quantities p,p, 91 , 92 , 93 - Each of these is a function of r and 
t. The central problem of fluid dynamics is to determine the explicit dependence of these 
five dynamical variables on r and t. Thus five non-trivial equations relating these variables 
are needed in order to solve for them. Specifications of an equation of state, an equation of 
continuity and one equation of motion in vector form usually satisfy these basic requirements. 


14.3 EQUATION OF STATE 

An equation of state, by definition, is an explicit relation between the pressure and the 
density of any given fluid, that is, p as a function of p. But more often such a relation 
also involves other parameters, such as temperature, entropy, etc. This invariably means an 
increase in the number of variables involved and hence one has to look for extra equations. 
An ideal gas, for example, has an equation of state p = pRT'/p, where p is the mean 
molecular weight of the gas, V the temperature and R the gas constant. If we want to keep 
p as a function of p alone, we must consider either an isothermal, (that is, T 1 =constant) 
case or an adiabatic (that is, isentropic) case, for which pap and pap 7 are respectively 
valid, with 7 = c p /c v , the ratio of the two specific heats. For simplicity one can parametrise 
the relationship in the form of p a p 7 , where 7 can be regarded as a constant but adjustable 
(floating) index. A physicist prefers to call an equation of state ‘hard’ if 7 > 2, and ‘soft’ if 
an effective 7 < 2. However, when 7 = 1, it is the value of the constant of proportionality 
that defines the hardness of the state. For an incompressible fluid the equation of state is 
simply p = constant, where the constant is independent of p. A fluid for which pressure p 
depends only on the density is called a barotropic fluid. 


14.4 TYPES OF TIME RATES OF CHANGE OF QUANTITIES 

Before we move on to derive the equation of continuity, we need to introduce three kinds of 
time derivatives that usually appear in the description of the motion of any fluid. 

(i) The local rate of change of any quantity is defined to be the rate of change of the 
quantity measured at a given point in the fluid that is fixed with respect to a fixed frame. 
If P is a fixed point with coordinates ( x,y,z ) referred to the fixed frame then the local rate 


Copyrighted material 



480 Classical Mechanics 


that is, when it prefers to swim. If t> is the velocity of the swimming fish with respect to 
the shore (or any other fixed frame), then 

|-|+(-V)p (14.5) 

Again a few more definitions: 

Steady State : A quantity is said to be in a steady state if its local time derivative is zero, 
that is, d/dt = 0. In other words, at any fixed point the quantity does not change with 
time. 

Equilibrium or Stationary State: A quantity is said to be in equilibrium or stationary 
state, if its comoving time derivative is zero, that is, D/Di = 0. That is, if one moves with 
the element of fluid, the quantity under consideration does not seem to change with time. 

When the entire fluid motion is in the steady state, none of the quantities p,p and q 
seems to change locally, although they may have different values at different points. That 
is, they do not have explicit time dependence in their functional form. On the other hand, 
when the entire fluid is in stationary state (that is, for an equilibrium flow), none of the 
quantities p,p and q should change with respect to any comoving observer associated with 
any specific fluid element. 


14.5 EQUATION OF CONTINUITY 


Let a region of space having a total volume V and a boundary surface 5 be fixed with respect 
to time. Let a quantity ip = ip(r, t) be defined at all points of the fluid and represent a density 
of some physical quantity (that is, mass density, momentum density, energy density, etc.). 
The net local rate of this quantity integrated over volume V is, by definition: 

This must equal the total rate of its influx through the boundary surface 5 which consists 
of two components, one due to the motion of the quantity along with the motion of the fluid 
through the boundary surface 5, that is, 




(ipq) • dS 

and a second component due to a generating flux J$ of the quantity at every point of the 
fluid, giving an influx of 

-jj+dS 

A total generation of the quantity inside the volume V at a rate g+ is also possible which 
amounts to an increase in the quantity inside V given by 


9*dV 


Copyrighted 



482 Classical Mechanics 


Second, if the motion is stationary , that is, Dp/Dt - 0, then 

V • g = 0 (14.14) 

which is valid for any fluid, be it compressible or incompressible, ideal or nonideal. 


14.6 APPLICATION TO LIOUVILLE’S THEOREM 


Consider the phase space of a conservative system. Under the natural motion of the phase 
space, the velocity t> of any point (gi,g2,--,g n ;Pi>P2»-->Pn) in the phase space is given 
by the components ft,®,••*,</„ and p'i,P2, • • • ,p„ where n is the number of DOF of the 
system. Therefore, the divergence of v taken over the 2n dimensions of the phase space is: 



= V (?2i + ?Ei\ 
dpj 

sr \l > (— \ _ JL ( ?1L > 

UpJ dvAdqiJ i 

- 0 (14.15) 


where H(qi, 92 , • • • ,q n ',Pi,P2, • ■ • ,p n ; t) is the Hamiltonian of the system. If the density of 
the ‘phase fluid’ at any instant is p, then 

(14.16) 

Hence p is stationary, that ; s, the comoving density of the phase fluid does not change with 
time. So a region of the phase space containing the particles evolves as a whole but the 
density of the phase fluid remains unchanged. This is the essence of Liouville’s theorem 
(1838). 


14.7 EQUATIONS OF MOTION 


We apply Newton’s second law of motion to any control volume V bounded by the surface 
S comoving with the fluid. This gives 


[ D* 

J P~jy[dV = Total force on the boundary surface 5 + the body force inside V 




(14.17) 


where is the stress tensor and g is the externally applied body force per unit mass (see 


Copyrighted material 


Fluid Dynamics 483 


(14.18) 


Eqs (13.24) and (13.26)). Since V is arbitrary we must have 
Dqi d(T ik 

p -Di = -aiT + Mi - 

Now since c** = — p Sik (for a perfect fluid) 

daik dp dp 

dx k ~ dx k k ~ dxi 

giving 

^ = -Vp + p S (14.19) 

The vector equation Eq. (14.19) is called Euler’s equation of fluid motion. It was first 
derived by Euler in 1755. Note that when the fluid is in equilibrium, that is, Dq/Dt = 0, 
we get 

Vp - pg (14.20) 

Thus the gradient of pressure has the same direction as that of the externally applied body 
force at any point. Further when g = 0 we see that Vp - 0, so that the pressure is constant 
in space and time. We shall deal with the equilibrium situation later. 

Writing Dq/Dt explicitly, Eq. (14.19) becomes: 


Using 

Eq. (14.21) becomes 


+ P (9 -V)«= -Vp + pg 


V(? • q) = 2 (g • V)g + 2 jx(Vx 9 ) 


^ + ^Vp + ^V(q 2 ) -qx(Vxq)-g = Q 


(14.21) 


(14.22) 


This is another form of Euler’s equation of motion for a perfect fluid. 


14.8 PRESSURE POTENTIAL 


Let us define the quantity 


>= r d i. 

Jp 0 P 


(14.23) 


to be the pressure potential of the fluid at any given point r and time t. If the equation of 
state is known and is in the form of p as a function of p only (that is, for barotropic fluids), 
one can determine the pressure potential P. From this definition of P it follows that 



P 


Copyrighted material] 



484 Classical Mechanics 


or 

VP = -Vp (14.24) 

P 

For an incompressible fluid p = constant, therefore, 

P = (14.25) 

P 

One would now like to know whether P signifies some known physical quantity or is just 
a mathematical object. 

From thermodynamics we know that the enthalpy function H' satisfies the following 
differential relation 

dH' = r dS' + V'dp = V dS' + — 

P 

since V' = 1/p. This gives the relation 

(dH') s . =c „ a , t , = ^=dP (14.26) 

P 

For any adiabatic fluid flow, that is, a flow without loss of heat due to thermal conduction, 
viscosity, etc. (which is guaranteed by the motion of a perfect fluid), the entropy remains 
constant. Thus we may say that the motion of perfect fluid corresponds to an isentropic 
flow. For such motions of the fluid P = H', that is, the pressure potential is nothing else 
but the enthalpy of the system. 

However, if the motion is isothermal, consider the Gibbs potential function satisfying 

dG' = -S' dT' + V' dp = -S' dT' + — 

P 

so that, for constant temperature, 

(dC')r=•»»«. = - = <JP (14.27) 

P 

Thus the pressure potential may be identified with the Gibbs potential G' if the fluid flow 
takes place at a constant temperature. Such a fluid flow may even have viscous losses, or in 
other words, the fluid may even be imperfect in nature. 


14.9 EXTERNAL FORCE FIELD 

If the external force field is conservative in nature, g may be derived from a potential 
function, say fi. That is 

g = -Vfi (14.28) 

For example, the force of gravitation and the centrifugal force of rotation can be derived 
from suitable potential functions. Hence for a wide variety of situations, Eq. (14.28) would 
be quite useful. Using Eqs (14.24) and (14.28), Euler’s equation (Eq. (14.22)) can be 


Copyrighted material 



486 Classical Mechanics 


Now, if g is assumed to be constant, 


p = Po e -’“‘ /nr 

(14.33) 

p = Poe-"”"'’" 

(14.34) 


where ( pg/RT')~ l is the scale height of the atmosphere, which is 8.8 km for V = 300/f, 
\i — 28.8 and g = 9.81 ms~ 2 . Such an exponential (isothermal) atmosphere must extend up 
to infinity and the pressure and density must drop by a factor of e = 2.7182 every 8.8 km 
above the ground. One can of course find more realistic formulae, using p = p 1 (adiabatic 
variation) and g - GM/(R © + z) 2 instead of a constant. 


14.11 BERNOULLI’S THEOREM 

14.11.1 A Few More Definitions 
(*) Vorticity Vector. This is defined as 

u=±Vxq (14.35) 

where w is a vector point function (vector field) representing the rotation of the fluid. 


(«) Scalar Velocity Potential: For irrotational motion of the fluid u> - 0 = V x q, which 
means that q can be expressed as the gradient of some scalar potential function 0(r, t) called 
the scalar velocity potential 

q = ~V(p (14.36) 

For an incompressible fluid we have seen that V • q = 0. Hence, for an incompressible 
fluid having an irrotational motion, the scalar velocity potential must satisfy 

V 2 0 = o 


or, more explicitly 


av aV av 

dx 2 + dy 2± dz 2 ~ 


(14.37) 


Hence 0 is a harmonic function for any perfect incompressible fluid that has an irrotational 
motion. 


(»i) Velocity Vector Potential: For the steady motion of an incompressible fluid flow 
V q = 0. This means that q can be expressed as the curl of some vector point function, say 
A(r, t) so that 

q = V x A (14.38) 

where A = A(r, t) is called the velocity vector potential. Such fluid flow can have vortidty 


Copyrighted material 



488 Classical Mechanics 



Fig. 14.1 (a) Stream tube and (b) vortex tube 


Proof: Euler’s equation of motion, Eq. (14.29), gives 

§f - « X (V X 4 ) = - vv 

where ip is defined by equation Eq. (14.42). Since the flow is steady dq/dt = 0, and hence 
2 q x of = Vip (14.43) 

Taking the scalar product of the above with i„ = <i>/|w|, we get 

• Vip = 0 (14.44) 

Now along the vortex line, dr = |dr|<„. Let the corresponding change in ip be dip. Then by 


Copyrighted material 




Fluid Dynamics 493 


increase in the flow velocity in the constricted part. The equation of continuity requires 
that the total flow rate Q remain the same everywhere, that is, 

Q = q\A\ = 92-42 



Fig. 14.3 Working principle of venturimeter 


Therefore the pressure difference is 


Pi - Pa = ~ q\)p 

If a manometer reads the pressure difference in terms of the difference in height h as shown 
in Fig. 14.3, then 

Pi - P2 = pgh 


giving 


2h) = Qi (if _ t?) 


or 

Q = \j 2h9 ^Al “ ^ (H ' 53) 

Therefore the flow rate can be directly checked and measured by noting the manometer 
readings. 


(») Bunsen’s Jet Exhaust Pump: A strong jet of air is passed through a sealed tube 
connected to a chamber to be evacuated (see Fig. 14.4). The high speed of air at the 
narrow end of the muzzle creates a low pressure region, which sucks in the gas inside the 
chamber. A vacuum cleaner also works on the same principle. 


(in) One dimensional How of gas through a nozzle: Let the x-axis be the axis of the nozzle 
whose cross-section decreases from left to right, ending finally into the narrow opening of 
the nozzle through which the fluid comes out with high speed (see Fig. 14.5). Let po> Po> <7o 
and i4 0 be respectively the pressure, density, speed and the cross-sectional area of the fluid 


Copyrighted 














494 Classical Mechanics 



at the mouth of the nozzle (that is, at z = z 0 in Fig. 14.5), and let p, p, q and A denote 
the same quantities at some arbitrary value of z. By the equation of continuity we require 
that the total flow rate Q across any surface through which the entire flow is taking place 
must remain constant. This means, 


Apq = const. 

where q is the z-component of the velocity q. Equivalently, 


In A + In p + In q = const. 

or differentiating, 

1 dA 1 dp 1 dq 

-+_£1 -|—_ = o 

A dx pdx qdx 


(14.54) 


Assuming the effect of the gravitational potential to be negligible, Bernoulli’s theorem 
gives 


+ 


rjp_ 

jpo p(p) 


= const. 


Noting that p is in general a function of z and that p 0 is a constant, we get, by Leibniz’s 
rqle for differentiation of an integral 


jf_ [ p dp_ _ 1 _dp 

dx J po p(p) ~ p(p) dx 


Copyrighted 





Fluid Dynamics 495 



Fig. 14.5 Flow through & constriction 


Therefore differentiating Bernoulli’s equation with respect to x we get 


dq ^ l dp dp 
H dx p dp dx 


0 


or 


dq c 2 dp 
^ dx + p dx 


= 0 


(14.55) 


where c — y/(dp/dp) is the speed of sound in the medium in question. Using Eq. (14.55), 
Eq. (14.54) can be written as 


1 dA _ 1 / q 2 \ dq 
A dx q \ c 2 J dx 


(14.56) 


The ratio of the speed of fluid and the speed of sound is called the Mach number, denoted 
by M = q/c. Obviously, 

(a) M < 1 means q < c, that is, the flow is subsonic, 

(b) M = 1 means q = c, that is, the flow is sonic , 

(c) M > 1 means q > c, that is, the flow is supersonic. 

Thus, from Eq. (14.56) 


UM 2 - 1)5 


]_dA 
A dx 
dq_ _ q 
dA A(M 2 - 1) 


Now we deal separately with three possible cases. 


(14.57) 

(14.58) 


(a) Subsonic, flow: For this case M < 1 , which by Eq. (14.58) implies dq/dA < 0. Thus 


Copyrighted 



496 Classical Mechanics 


the velocity of the fluid decreases as the cross-sectional area increases. Hence, as the fluid 
approaches the mouth of the nozzle while having subsonic velocities all the time, the velocity 
of the flow increases. 

(b) Sonic flow: Here M = 1. Using Eq. (14.57) we get dA/dx = 0 for Af = JL It 
means that when the fluid passes through the sonic point, the cross-sectional area of the 
flow remains unchanged. 

(c) Supersonic flow : M > 1, for which dq/dA > 0 by Eq. (14.58). In this case the 
flow velocity increases with the cross-sectional area, which is quite contrary to its subsonic 
behaviour. 



Pig. 14.6 A subsonic flow can become a supersonic one while passing through a 
narrow enough constriction 


The above analysis suggests a way to accelerate the’ flow speed past the sonic point (see 
Fig. 14.6). A nozzle is constructed, which opens out on both sides, and the dimension of its 
neck is such that for some initial velocity field of the gas the flow reaches the sonic point at 
the most constricted zone. In that case the flow of the fluid can easily be accelerated to large 
supersonic values of the Mach number M. Since the flow becomes supersonic the density 
structure no longer remains continuous and a surface of discontinuity of density (called a 
shock front) develops. 


14.13 GRAVITY WAVES AND RIPPLES 
14.13.1 Gravity Waves 

Suppose a train of waves is propagating on the surface of a pond, lake or sea. The whole 
column of the water right from the bottom to the free surface, has to take part in it, and 
such waves are called body waves. Since water is incompressible, the wave pattern that 


Copyrighted 



498 Classical Mechanics 


Let us consider the body waves of very short wavelengths (compared to the depth h 
of the fluid). The pressure at the trough can be taken to be approximately equal to the 
pressure at the crest, which again is the same as the atmospheric pressure at that level. 
So there is no change in pressure over the surface of the wave. But the gravity head has 
changed by 2a between the crest and the trough. Hence by Bernoulli’s theorem we must 
have 

-q\ + Q + fli = -ql + ^ + n 2 (14.61) 

2 p 2 p 

As pi = P 2 , q\ = ( 2irac/X ) - c, q 2 = - (27rac/A) - c and Sli - ft 2 = 2<ja, the above equation 
reduces to 

2 ga = l - 
or 


(14.62) 

Thus the speed of the gravity waves, (that is, waves for which A -C h), is independent 
of amplitude but depends on wavelength. Indeed the longer the wavelength, the higher the 
speed. Thus very long wavelength deep sea waves will arrive at the shore in the least time. 
So, if there is a volcanic eruption under the sea, the water waves of longer wavelength will 
carry the message faster. The same is true for sea storms, such as hurricanes or psunamis, 
where the sea propagates the message in the form of sending huge deep water waves out to 
the shore. 




14.13.2 Ripples 


Ripples are deep water waves of extremely small wavelengths for which the surface tension 
effects are not negligible. Due to surface tension, the pressure just below the surface of the 
fluid is different from that on the free surface, which is the atmospheric pressure p 0 . The 
pressure difference due to surface tension T across any surface of a fluid is given by, 


”= T (i + i) (1463) 

where R\ and fi 2 are the two radii of curvature of the surface in two perpendicular sections, 
both normal to the free surface. 

In the above case, in one section (in the y-z plane) the curvature is zero but in the other 
section (in the x-z plane) the curvature is given by 


2 _ 

R 


d?y/dx 2 
(1 +dy/dx)* 


Since the curvature is maximum at the trough as well as at the crest, where dy/dx = 0, 
using Eq. (14.60) we get 


_ 47r 2 p 

R ~ A 2 


(14.64) 


Copyrighted 



Fluid Dynamics 499 


Therefore, at the crest, the pressure just inside the surface is 
| T | 47r 2 oT 

P° + | — I = p° H-p— = pi * 

Similarly at the trough, the pressure just inside the surface is 


Po - 


= Po - - 


4n 7 aT 

A 2 


■ =P 2 


Now for this case, Bernoulli’s equation takes the following form 
8 ir 7 Ta _ Atcac 2 
~ ~X~ 

which gives 


2ag + • 


'V2tt A p 


For small enough wavelengths the above reduces to 

2*T 
A P 


(14.65) 


(14.66) 


(14.67) 


which is controlled by the surface tension only. Such small wavelength waves are called 
ripples. Since the speed of the wave c according to Eq. (14.66) increases both for large and 
small A, its value will assume a minimum for some A = A cr | t ic»i (see Fig. 14.8). 



Fig. 14.8 The minimum speed of propagation of surface waves for 
the critical wavelength of propagation 


One can show that 


Acritical — 2lT»/- 

7 P9 



(14.68) 


Copyrighted material 



500 Classical Mechanics 


For water T = 7.0 x 10 -2 N/m and p = 10 3 kg/m 3 , therefore 

^critical = 0.0172 m and Cn,j n = 0.231 m/s (14.69) 

For A 2> Acriticaij the waves are gravity waves; and for A AciUcai* the waves are capillary 
waves or ripples. 

The group velocity of these waves is given by 
duj 1 f 

v g = — = -c for gravity waves 

= ~c for ripples 

Thus on the water surface, the wave groups raised by a boat or a swarm of fish will travel 
at half of the speed of the individual crests if A > 0.02 m. Similarly, the wave groups will 
travel faster than the individual crests for A < 0.015 m. 

Suppose a stream is flowing with a speed c > 0.23 m/s and a stone is sticking out of the 
water surface. The capillary waves formed at the stone will be found to travel up the stream 
and the gravity waves will propagate down the stream. But if the velocity of the stream is 
less than 0.23 m/s, no such waves are formed. 

14.13.3 Shallow Water Waves 

This case applies where the wavelength of the waves A is very large compared to the depth 
of the fluid h. The fluid velocity q is then taken to be uniform over any vertical cross-section 
of the fluid, but q varies with x, the direction of propagation of the waves (see Fig. 14.9a). 
Let T\ be the elevation of the wave (see Fig 14.9b). If Bernoulli’s theorem is to be applied 
the observer must move with a speed c which is that of wave propagation. In this case, 
Bernoulli’s theorem gives 

\{Q ~ c) 2 + g(h + tj) = ~c 3 + gh 
and the equation of continuity is 

(q + c)(/» + r i)-ch 



Fig. 14.9 Breakers or shallow water waves; a case when the 
wavelength of propagation > depth of water 


Copyrighted material 



Fluid Dynamics 501 


The above two equations give (neglecting ( y/h) 2 and (q/c) 2 ) 

—qc + gy = 0, and qh - cy = 0 

Solving for c, we get 

c=y/ih (14.70) 

The wave velocity is now independent of both the wave amplitude and wavelength. The 
speed depends only on the depth h of the water. 

If the depth of the water is 10 m, the wave speed is about 10 m/s. This speed decreases, 
as the wave gradually approaches the shore. One can easily notice how the speed of wave 
propagation diminishes as the breakers approach the shore line. 


14.13.4 The Most General Case 


The above three special cases can be derived from the most general result for the speed of 
the waves given by 


■-(g *£)“(?) 


(14.71) 


where h is the depth of the water, T is the surface tension and A the wavelength. This is 
left as an exercise. 


14.13.5 Streaming and Shooting Flows 

When a river is flowing steadily in a straight, uniform, horizontal channel of constant width, 
the flows can have interesting properties. Let the depth of the water flow be h, and its 
velocity q. Let us also assume, for convenience, that the width of the flow is unity. Therefore, 
the discharge rate Q, which is defined to be the volume of liquid flowing per unit width per 
unit time, is given by 


Q = qh 

(14.72) 

Bernoulli’s theorem says that 


gh + = const. = gH (say) 

(14.73) 

Eliminating q from Eqs (14.72) and (14.73) we get, 


0 

II 

0*1^ 

+ 

1 

*3s 

(14.74) 

Let us define two dimensionless quantities by 


F " Q Jfe 

vsK" vW <? ! 

(14.75) 

In terms of F and k , Eq. (14.74) can be expressed as 


(2 + F 2 ) 3 = JfeF 2 

(14.76) 


Copyrighted 



502 Classical Mechanics 



Pig. 14.10 Graphical solution of two simultaneous equations y = kx and 
y = (2 + x) 3 . No solution exists for k < 27, but two discrete 
solutions do for k > 27 


This is a transcendental equation and can be solved graphically. Putting F 2 = z, the 
solutions to Eq. (14.76) are given by the intersection of the curves y = kz and y = (2 + *)*, 
shown in Fig. 14.10. Note that if k < 27 these two curves never intersect, and therefore 
there is is no solution. If lb = 27, the curves touch at x = 1 and y = 27, which corresponds 
to the unique solution 


Q max — 



(14.77) 


which in turn implies q = y/gK. If lb > 27, there are two points of intersection for x > 0, 
both of which correspond to a value of Q < Q m * x . One of them is for F 2 < 1 implying 
q < y/gK, and the other for F 2 > 1 implying q > y/gK. Thus these two solutions classify 
the stream flows into two distinct types: the former with velocity less than y/gK is called 
the streaming flow, and the latter with velocity greater than y/gK is called the shooting flow 
(see Fig. 14.10). 

This effect is in fact seen in shallow water streams where suddenly the shooting solution 
jumps to the streaming solution. This happens, for example, near a dam or barrier, where 
there is a sudden jump in the height of the water column. Loosely speaking, this can be 
thought of as a fluid mechanical analogue of the quantum jump from one possible eigenstate 
(of the stream velocity) to the other. 


14.14 TWO-DIMENSIONAL STEADY IRROTATIONAL FLOW OF 
INCOMPRESSIBLE FLUIDS 

Sir^e the flow is irrotational we must have q = - V0, where is the scalar velocity potential, 


Copyrighted material 


Fluid Dynamics 503 


and since the flow is steady and the fluid is incompressible, V q = 0. Combining these two 
conditions we have V 2 0 = 0, that is, <p satisfies Laplace’s equation in two dimensions 


dx 2 + 


(14.78) 


Similarly, taking divergence of Eq. (14.47), V 2 V> = 0. Therefore <f> and ip are harmonic 
functions. 


A planar flow means that the flow extends in two dimensions, say x and y, but the 
thickness in the 2 -direction remains constant. Since the description is two-dimensional, 
a representation of the velocity potential in the complex plane is possible. A large and 
important class of functions of complex variables z (= x 4- iy ) exists, such that both their 
real and imaginary parts satisfy Eq. (14.78). The properties of such functions, called 
analytic functions, can be profitably used to obtain the solution 0(x,y) of Eq. (14.78). This 
<p(x,y) then determines the flow pattern by the relation q = -V<p. 


14.14.1 Condition for Analyticity of a Complex Function 


Let /(z) be a complex valued function of the complex variable z = x 4- iy with /(z) = 
4>{x,y) + iip{x,y), where <p{x,y) and ip(x,y) are some real functions of x and y. /(z) is said 
to be differentiable at z if the limit 


n- 

t'—z Z' — Z 


(14.79) 


exists. However, this limit will, in general, depend on the manner in which z' approaches z. 
In order that /(z) be differentiable at z this limit should not be dependent on the direction 
of approach. A function /(z) is said to be analytic at a point z if it is differentiable at z 
and also at every point in some neighborhood of z in the complex plane. 

If we evaluate the above limit along the direction of the real and imaginary axes, we 
obtain respectively 

Um /(»*)-/(«) _df _d<p dip 

*' —* (s' + iy) - (x + iy) dx dx dx 

u /(«*)-/(«) = 12/ = I = *£ _ M 

v'-+v (x + iy 1 ) - {x + iy) i dy i \dy dy ) dy dy 


For /(z) to be analytic these two limits ought to be equal. This gives us the following 
conditions 

d<p_ _ dip_ drp_ _ _d<p_ 

dx ~ dy and dx ~ dy 


(14.80) 


These are called the Cauchy-Riemann conditions for the analyticity of f(z). Note that in 
the above we have only shown that these conditions are necessary. However they can also 
be shown to be sufficient. 


From Eq. (14.80) it is obvious that the real and imaginary parts of an analytic function 


Copyrighted 



504 Classical Mechanics 


/(*), namely <p(x,y) and ip(x,y), satisfy 

. ^_ n 

&r 2 5j/ 2 “ 


and 


a*’ a» J “ 


(14.81) 


Next consider the integral of f(z) along a curve C in the complex plane 
f f(z)dz = f (<p + iip)(dx + idy) 

Jc Jc 

= [ (<pdx - ip dy)+ i f (ip dx + <pdy) 

Jc Jc 

This integral will be independent of the chosen path C and will be a function only of the 
endpoint coordinates, if and only if the quantities in the parentheses are perfect differentials. 
This is so if and only if the Cauchy-Riemann conditions Eq. (14.80) are satisfied. Thus the 
functions <p and ip are conservative and irrotational. Note that the above analysis also 
implies that § c f(z)dz = 0 if the function /(z) is analytic in. the region bounded by the 
closed curve C. 

Now it is easy to see that at all points 




d<f> dip d<p dip 
dx dx + dy dy 


(14.82) 


= 0 


by virtue of the Cauchy-Riemann conditions. Thus the equipotential surfaces for <p and ip at 
each point are perpendicular to each other. If <p(x,y) is taken to be the velocity potential, 
then the velocity q = -V0 must be along the line of the constant ip. Bernoulli’s theorem 
says that the stream function is constant along all streamlines. The streamlines are along 
the direction of instantaneous velocity or along -V0. So ip can be treated as the stream 
function for the problem. 

On the other hand, since all analytic functions have to satisfy the Cauchy-Riemann 
conditions, the real part <p and the imaginary part ip of any analytic function must represent 
the velocity potential and the stream function pair for some steady, planar, irrotational flow 
of incompressible fluids. 

Examples 

Let us now pick up some analytic functions and see what type of flow patterns they 
represent. 


(i) /(z) = z 2 = (z 2 - y 2 ) + i 2 xy. Thus 

<P{x,y) = x 2 - y 2 and rp{x,y) = 2 xy 
The flow pattern is depicted in Fig. 14.11. 

This is the flow pattern expected around a rectangular corner. (Combine a half y -axis and 
a half z-axis to form a rectangle.) 


Copyrighted material 







506 Classical Mechanics 


This gives 

2d> 2 

= 2rcos 2 (9/2) = r( 1 + cos0) = r + x 
and 

2il) 2 

—£jr = 2rsin 2 (0/2) = r(l - cos0) = r — x 

Hence, <j> = constant and rp = constant are the confocal and coaxial parabolas respectively 
(See Fig. 14.13). This corresponds to a flow turning around the edge of a semi-infinite plane 
sheet. 



Fig. 14.13 Two-dimensional flow around a semi-infinite straight 
line 


(iv) f(z) = - M/(2tz), M being a real constant. This gives 


M cos 0 


and 


M sin 0 


The resulting flow pattern is shown in Fig. 14.14. 

This flow represents a doublet source with a source and sink sitting at the origin. The 
streamlines are like that of some dipole field lines. The source strength M is like the dipole 
moment of the source flow. 


( v ) /(*) = QoZ. This gives a uniform stream with stream velocity q 0 in the direction of 
,the negative z-axis. 

14.14.2 Flow around Bodies of Simple Shape 

The flow around any axisymmetric body can be reproduced by embedding a distribution of 
sources and sinks in a uniform translational flow along > the axis of the body. For a single 


Copyrighted material 



Fluid Dynamics 507 



Fig. 14.14 The pattern of flow around a 2-D doublet source consisting of a 
source and a sink of equal strength, at an infinitesimal separation 

spherical source of strength M placed in a uniform stream of velocity q a , the stream lines 
are shown in Fig. 14.15. The strength M of a source is defined so as to represent the total 
mass emitted per second by AnpM, p being the density of the incompressible fluid. 



Fig. 14.15 Streamlines around an axially symmetric obstacle, con¬ 
structed by a method similar to the method of images 
used in electrostatics 

Since there is no flow across the stream lines, the flow just outside the body must repro¬ 
duce the shape of the body. The position of the dividing streamline defines the shape of 
the body in question. For this type of a flow an approximate choice of the stream function 
would be 

rf>{r,0) = - ^q 0 r 2 sin 2 9 + M cos 6 (14.83) 


Copyrighted ms 



508 Classical Mechanics 


and similarly for the velocity potential, 

M 

<p(r,9) = - q a T cos 9 + — (14.84) 

Note that the velocity potential <j> in equation Eq. (14.84) can be obtained from the 
stream function (Eq. (14.83)) using the Cauchy-Riemann conditions. 

14.14.3 Flows Around More Complex Axially Symmetric Shapes 

One can put a number of sources and sinks along the symmetry axis and follow the method 
similar to that of the method of images in electrostatics to produce the desired position of 
the dividing streamline which corresponds to the profile of the body. 

For a number of sources and sinks put along the symmetry axis, 

1 n - 1 

0(r,0) = — sin 2 9 + 22 M t cos 9i (14.85) 

* = o 

and 

<fi(r,9) = -q 0 r cos 0 + — (14.86) 

»= o r * 

where Mi is the strength of the i th source and 0, is the angle that the radius vector r* 
makes with the symmetry axis. 

At the position of the dividing stream line (that is, where 6 = tt), the value of the 
stream function is 

n — 1 

= - Y, Mi < 14 - 87 ) 

i - 0 

Now putting this value of ip 0 in Eq. (14.85), one can obtain the equation of the dividing 
streamline r(0) as 

n-l n-l 

- £ Mi = - -q 0 r 2 sin 2 0 + 22 MiCasOi (14.88) 

* = o « = 0 

because at all points of the dividing streamline, ij> must have the value given by Eq. (14.87). 

Applying numerical procedures one can make guesses about Mi and 9i in such a way 
that Eq. (14.87) graphically resembles the shape of the given object (obstacle). In general 
fewer than about 20 guesses or iterations are required to generate a curve that matches 
the desired one with precision typically within 5 per cent. Once the correct distribution of 
Mi, Ti and 9i are guessed, the expression for <f>(r,9) yields the velocity distribution at any 
given point and rj*(r,9) gives the equations for the streamlines. 

14.14.4 Aerodynamic Lifting Force on Aerofoils 

The shape of an aerofoil is shown in Fig. 14.16. When it moves through air, the surrounding 
air appears to stream past the foil, with a pattern of streamlines also shown in Fig. 14.16. 


Copyrighted 



Fluid Dynamics 509 



Fig. 14.16 Flow around an aerofoil and the angle of 
attack 


The velocity of air streaming along the aerofoil is greater on the upper side than on the 
lower side. Small distances between the streamlines on the upper side indicate high velocity 
and vice versa. Hence the pressure is lower on the upper side than on its lower side. Thus 
there acts a resulting upward force on the wing, called the lifting force. The distribution of 
forces that act on the surface of the aerofoil is shown in Fig. 14.17 l with the direction and 
relative magnitudes represented by the arrowed lines. 



Fig. 14.17 Pressure distribution around an aerofoil 
moving through the air. The net upward 
force on the aerofoil provides the lift 


Obviously, this distribution of pressure force is created due to the particular shape of the 
aerofoil which forces the streamlines to follow its specific curved surface. The low pressure 
at the upper side is actually a consequence of the nature of the curved surface. An element 
of air following this curved path must experience centrifugal forces. So with the streamlines 
having a radius of curvature R we have the general formula for the pressure gradient 
developed due to streaming (given by Euler in 1754), 


dp PQ 2 

dR R 


(14.89) 


where q is the streaming velocity and p the density of the fluid. 

If the longitudinal axis of the aerofoil is inclined upward by an angle a with the horizontal 
called the angle of attack , the lifting force is found to increase with a up to about a = 25". 
The blades of ceiling fans, table fans and pumping sets are set to have an angle of attack 
close to the above figure in order to produce maximum pressure difference across the blades. 


Copyrighted 



510 Classical Mechanics 


14.15 KELVIN’S AND HELMHOLTZ’S THEOREMS 


We start with the definition of a quantity called circulation around a closed curve. Let T 
denote any closed curve drawn in the fluid at time i. Then 

c = j q dl = j q ldl (14.90) 

is called the circulation around the closed curve T at time t } l being the parameter chosen 
to describe the closed path I\ 

14.15.1 A Theorem on null circulation 

If the motion of a fluid in a simply connected region is irrotational at any instant, the 
circulation around any closed curve in that region at that instant is zero. 

Proof: By Stokes’ theorem, 

c = q ■ dl = J (V x q) dS = 0 

The converse of this theorem is also true. In general, the strength of a vortex tube is equal 
to the circulation around any circuit surrounding the tube. 


14.15.2 Kelvin’s Theorem on Circulation 


The circulation around any closed curve comoving with the fluid does not change with time 
if the external field of force (if any) is conservative and p is a function of p alone (that is, 
the fluid is barotropic). 


Proof: Let r denote the vector function giving the position vectors of all the points on the 
curve at any given time. Since the curve moves so as to contain the same fluid particles at 
all times, dl remains invariant, but not dr which can change with time because we only 
require 

| dr | = \Dr\ = dl 


Therefore, 



Copyrighted material 



512 Classical Mechanics 


time variation of the magnetic induction related to the space variation of the electric field, 
in the theory of electromagnetism. 

Some other properties of the vorticity motion emerge from Helmholtz’s theorem. Taking 
a dot product of both sides of Eq. (14.92) with w/p, we get 

SfiJ)-{(H’H') 

Let us define 

f ■ If I* 

n being the unit vector along w. Then, 

^ {'"(?)} == 2( *' v > 5 " < 14M > 

If q has to have continuous first partial derivatives, we must have 

|(n • V)q ■ n| < Q 0 (14.94) 

where Q 0 is a positive constant. Hence 

- Qo < (ik-V)f-ft < Qo (14-95) 

Now if at time t 0 , the vorticity is u> 0 and the density is p 0 , then from Eq. (14.93) 

ln(^) - ln(^|) < j* 2Q 0 dt' = 2 Q 0 {i - t Q ) (14.96) 

Hence 

^ < ^exp{2<?„(( - („)} (14.97) 

P PI 

This result can be interpreted as 

(i) If the vorticity of a particle is ever zero at any instant it must always be zero. Thus a 
motion with non-zero vorticity cannot be generated in a barotropic ideal fluid by conservative 
body forces if the motion is initially irrotational. Thus, 

(ii) the motion with vorticity cannot be generated in a barotropic ideal fluid initially at 
rest. 

Now taking the lower bound on (n • V)g h > - Q a , we get 

^ - („)} (14.98) 

p Pi 

Hence , we have, as further interpretations, 

(iii) a particle which initially has nonzero vorticity can never at any instant lose all of its 
vorticity, 

(iv) the vorticity vector of a particle can never reverse its direction, and 


Copyrighted 



Fluid Dynamics 513 


(v) the rate of change of vorticity in two dimensions for motions of incompressible fluids 
s given by Eq. (14.92) which, for two dimensions, reduces to 

Du , dq 

— = («-V), = u,, Wi 


since w does not have any x,y components for planar motion. But since (dq/dx^) — 0 
by definition of a planar flow, we must have for planar flows 


Du 

~Di 


0 


Thus for a planar (two dimensional) motion of an incompressible fluid the vorticity of any 
particle remains constant. 


An Application 

A body of incompressible liquid rotates with constant angular velocity about the vertical 
axis under constant gravity. We want to find the free surface of its revolution and the 
vorticity at any point in the liquid. 

Let ft be the constant angular velocity of rotation about the 2 -axis so that ft = f 'Ik (see 
Fig. 14.18). 


Z 



Fig. 14.18 Liquid in a rotating bucket as¬ 
sumes a paraboloidal surface 


Now the velocity q of a particle at r at time i is 

q(r,<) = ft x r where r — xi + yj + zk 


Copyrighted 



514 Classical Mechanics 


Therefore, 

^ = nx^ = nx, = nx ( nx,) 

= (n r)0 - n 2 r = - n*(zi + y'j) (14.99) 

= - in ! V(* J + y 1 ) 

In order to construct Euler’s equation of motion we note that the external body force is the 
gravitational force, 

F = -gk = - V(gz) (14.100) 

and the pressure potential is given by Eq. (14.25) for incompressible fluids. Therefore, using 
Eqs (14.99), (14.100) and (14.25*), Euler’s equation reduces to 

- in 2 V(* J + » 2 ) = - v 

or 

n 2 (* 2 + y 2 ) - 2 gz - = const. (14.101) 

P 

Since the surface of rotation under consideration is a free surface for which p = p a> its 
equation would be given by 

z = ( x2 + y 2 ) + const. (14.102) 

This is an equation for a paraboloid of revolution about the z-axis, which represents the 
free surface of a rotating mass of incompressible liquid with a constant angular velocity. 
The vorticity of the liquid at any point and at any instant is 

u = x q = x (ft x r) = fii (14.103) 

Thus the vorticity at each point of the fluid is the same as the angular speed of rotation 
of the liquid mass. This gives a direct physical interpretation of vorticity, as the angular 
velocity of rotation of a body of fluid mass. 


14.16 REPRESENTATION OF VORTICES BY COMPLEX FUNCTIONS 

A representative two-dimensional vortex is a cylindrical vortex. By definition, a cylindrical 
vortex of radius a implies a constant vorticity u throughout r < a, but vortices vanish 
outside. The strength of such a vortex is defined to be k = - u>o 2 /2. The fluid inside the 
vortex is as if it were a solid cylinder rotating with an angular velocity at/2. Outside the 
tube the motion is irrotational. 

The complex function that represents such a vortex motion, is given by 

f(z) = i k ln(z) (14.104) 


Copyrighted material 



Fluid Dynamics 515 




<b) 

Fig. 14.19 Natural drifting a pair of vortices having equal but opposite strengths, 
shown in (a). When such a pair of vortices is put in a stream hav¬ 
ing a velocity equal but opposite to that of natural drifting of the 
pair, the pair of vortices remains stationary, shown in (b). 


where k is the strength of a vortex. However, the function /(z) defined in Eq. (14.104) 
actually represents a circular vortex that has shrunk to the origin in such a way that its 
strength k has remained constant. Such a concentrated vortex is called a vortex filament 
or point vortex. 

Two equal and opposite vortices, separated by ± a from the x-axis but oriented along 
the y-axis, can be represented by a complex function 

= '* ln (^) < 14105 > 
The distance between the two vortices remains constant, but both the vortices move to the 
right with the same horizontal velocity equal to what is called their induced velocity given 
by k/2a. Here k is the strength of each vortex (see Fig. 14.19(a)). 

These two vortices may remain stationary, if they are placed in a uniform stream having 


Copyrighted materi: 



Fluid Dynamics 517 


A = 0. Both £ and r) are positive constants. 

The incompressible liquids that follow Eq. (14.111) (with A = 0, of course) are called 
Newtonian liquids. They simply obey a[j = 2rjbij = 2rjeij, as A = 0. Here tj is called 
the coefficient of dynamic viscosity. 


14.17.1 Navier-Stokes’ Equation for Imperfect Fluids 


The general equation of motion for any fluid is given by 
Dq t doij 

p w = Mi + sr t 


(14.112) 


Now 


d(Tik 

dx k 


d_ 

dx k y 


dA 

k dTk 


. d 


(dqi ■ dg k \ 

\dx k + dxi) 

= _ dp_ , r , - d 2 q k _ d 2 qi 

dn dn \dx k ) ^ dx k dxi ^ dx\ 

= ■ Wi + ( * + p) £$r k + pV ’ qi 

■ _ w, + ( f + I’) + lVlqi 

Therefore, the most general equations of motion for viscous fluids are given by 

p (jjt + *sj) = _ Wi + ( {+ b)h [Vq) + vV2q ‘ ~ p S 

where Q is the conservative body force potential given by g = - Vft. Written in vector 
notation, Eq. (14.113) becomes 


(14.113) 


f + («-v)» = -Ivp - vn + 


)(( + ^)v(v,) + V f 


or using the identity appearing just after Eq. (14.21), 


|-,x(Vx,) = -v(d- U^lllv q + i,*) + 5V 2 , - ivp (14.114) 
For incompressible fluids 

+ (9- V)g = - -Vp - Vfi + ^v 2 9 (14.115) 

m p p 

Eq. (14.115) is called Navier-Stokes’ equation. The coefficient 77 /p = u is sometimes 
called the kinematic viscosity of the fluid as v does not contain any mass factor in its 
physical dimension. 


14.17.2 Flow in a Pipe: Poiseuille’s Formula 

An incompressible viscous fluid flows steadily in an axially symmetric pipe of radius R , in 


Copyrighted material | 


518 Classical Mechanics 


the x direction. 



Fig. 14.20 Flow of a viscous fluid in a tube 


From Fig. 14.20, it is clear that q = ( 9 ,0,0). Since the flow is steady we must have 
dq/dt — 0. Further we have 

(«V)« = (^. 0 . 0 ) 

The very fact that the velocity only has an z-component means that we have neglected 
the effect of the gravitational force (which is in the negative z-direction) on the flow. We 
can do this if q is large enough. This amounts to putting Vfl = 0. Since the fluid is 
incompressible, V • q = 0, which in the present case becomes dq/dx = 0, implying that q 
is only a function of y and z, that is, q = q(y,z). Thus Navier-Stokes’ equations for the 
flow as depicted in Fig. 14.20 become 

0 = - m + 0 = % “ d ° = | < 14116 ) 

The last two of Eqs (14.116) imply that the pressure p depends only on z, that is, 
p = p(z). Now the first of Eqs (14.116) gives 

?P - 4 . *1\ 

8x V \dy 2 + dz*J 

Since LHS is a function of z only and RHS a function of y and z only, they can be set 
equal to a constant independent z, y and z. 


^ = const. = - a (say) 

Here — a is nothing but the constant pressure gradient along the z-axis, giving 

&q &q _ a 

dy 2 dz 2 j] 

In terms of polar coordinates the above equation becomes 


(14.117) 


or 


ld_ ( d_\ _ a 

r dr \ dr) q ~ q 


~ ~^ +0 


Copyrighted material 






Fluid Dynamics 519 


ar 2 

q = ~— +(3\nr + ' r 

where 0 and 7 are constants of integration. 

Now for r = 0, q must be finite. Therefore, 0 = 0. For r = R, q = 0 which 
means 7 = aR 2 /Arj. Thus 

- £(*’ - r 2 ) (14.118) 

4*7 

Now the flow rate is 

where a is the horizontal pressure gradient, 

*-¥ 

pressure difference between any two transverse vertical sec- 
_ tions 

separation between the above two sections 
Therefore we finally have for the flow rate 


npRfAp 

8rji 


(14.119) 


Equation (14.119) is called Poiseuille’s formula , derived by Jean Leon Poiseuille in 1840 
(also independently by Hagen perhaps two years earlier). 


14.17.3 Reynolds’ Number 

Osborne Reynolds (1883) introduced a dimensionless number to characterise the state of 
the viscous flow, defined by 

R, = ^ (14.120) 

where R t is Reynolds’ number , p is the density of the fluid, v is the velocity of the 
fluid, / is the characteristic length of the flow, and 17 is the dynamical viscosity. For a 
given Reynolds’ number the motion is expected to be dynamically similar, irrespective of 
the wide range of variations in p, v, l and tj. In order to have dynamically similar states 
for a given Reynolds’ number, a high velocity flow must be considered in a smaller spatial 
extent, whereas a low velocity flow can continue in a dynamically similar state for a larger 
spatial extent. Similarly, more viscous fluids can support a dynamically similar state for 
either higher velocities or over larger dimensions. 

Now what is it that changes with the flow velocity or increasing Reynolds’ number? 
Reynolds characterised the flow in two qualitative states. At lower Reynolds’ number the 
flow is streamlined or what is called laminar. Beyond a critical value of Reynolds’ number 
the flow becomes turbulent. Eddies develop in the flow, with nonvanishing circulation at all 


Copyrighted 



520 Classical Mechanics 


points, or in other words, dissipative vortices can develop in a random fashion. For example, 
for Poiseuille’s type of viscous flow in a pipe the critical Reynolds’ number is about 2800. 
The transition from laminar to turbulent flow takes place at different critical Reynolds’ 
number for different types of geometry of the flow. 

14.17.4 Drag Coefficient for the Motion of a Sphere Moving in a Fluid 

A smooth sphere of cross-sectional area A is moving with speed v through a fluid having 
density p/. The quadratic law of drag force is 

Fd = \c D p,Av 2 (14.121) 

where Cd is called the drag coefficient which, for a sphere, is a function of Reynolds’ number 

R _ 2r Pf v 
Vf 

r being the radius of the sphere and t?/ the viscosity of the fluid. For laminar flow, R e < 1, 



in which case 

Fp = 67 rqjrv (14.122) 

This is called Stokes ’ law of viscous drag force and can be used to determine 77 / by measuring 
v at various values of r. Remember that Stokes’ law is valid if R c < 1. For a rigorous 
proof of Stoke’s law,see Routh’s book, for example. 

The tdransition from laminar to turbulent flow can occur in the range of Reynolds’ number 
1 < R e < 10 3 , an approximate numerical fit to the observational results is given by 

log C D = log(0.37) + 0.21 {log J? c log(4400)} 1 ' 68 

For turbulent flows, 10 3 < R<. < 2 x 10 5 , and Cc\= 0-5- 

The above values of Co correspond to the motion of a sphere in a fluid. 

A quadratic drag law is quite valid for motion of vehicles that have to overcome the 
aerodynamic drag forces. Typically for a 

heavy truck, Co = 1 . 00 , 
bus, Cq = 0.65, 
good car, Co = 0.30, 
motorcycle, Co = 0.90, and 
light aircraft, Cd - 0-12 

At very large Reynolds’ numbers, R* » 10 3 , the flow is characterised by the presence of 
an extremely irregular variation of velocity with time at each point of the flow. The velocity 
at each point is seen to fluctuate around some mean value, but the amplitude of fluctuation 
is not small as compared to the magnitude of the mean velocity itself. Turbulent eddies are 
formed which also change in size and finally disappear, and this process continues leading 


Copyrighted material 



522 Classical Mechanics 



Fig. 14.21 Diagram for problem 14.1 

a small orifice- Find the velocity of the fluid relative to the tube as a function of the 
remaining column length h (see Fig. 14.22). 


O 





O' 


Fig. 14.22 Diagram for problem 14.2 


14.3 A long closed vertical cylinder of constant volume is completely filled with an incom¬ 
pressible fluid of density p except for a very small bubble of an insoluble ideal gas 
that is trapped beneath an inverted cup at a distance h below the top of the liquid 
(as it may happen in an oil well pipe). The absolute pressure in the liquid at the top 
of the cylinder is P a and at the site of the bubble P k = P 0 + pgh. The cup is now 
overturned and the bubble is allowed to escape. What happens to the pressure at 
various levels, for example, at the top, at the bottom of the cup, and at the bottom 
of the well. 

14.4 Show that 

(i) in the plane two-dimensional motion of an incompressible fluid, the vorticity of 
any particle remains constant, 


Copyrighted material 










524 Classical Mechanics 


and is given by g(r t t) = 0 , show that the boundary condition is now given by 

| + ,v s = o 

14.11 If a body of liquid is revolving about a vertical axis with an angular velocity u; = 
f(r)k, then show that the motion can be irrotational if 9 - rf(r)0, and /(r) oc r~ 2 . 

14.12 Water from a running faucet is found to be tapering downwards until it breaks up 
into fast-moving droplets. How does the diameter of the continuous part of the flow 
change with distance from the mouth of the faucet? 

14.13 When water is running out of a tap with sufficient speed, it is seen to form a nice 
circular halo of streaming water on the flat surface of the basin, due to a sudden 
changeover from shooting to streaming flow. Can you find the radius of transition? 
It might well be a research problem. 

14.14 Find the equilibrium distribution of the acceleration due to gravity, and density of a 
perfect fluid near the surface of a star (assume a parallel atmosphere, that is, neglect 
the effect of the finite radius of curvature of the surface of the star). 

14.15 A horizontally oriented circular plate is rotated with a very high angular speed. 
Another identical plate is almost on top of it with a small parallel gap between the 
two. The top disc will corotate with the bottom one due to frictional coupling. Now 
the centrifugal force acting on the gas molecules reduces the pressure of air inside, 
and the plates are attracted to each other. What is the magnitude of the force that 
will be required to keep the plates separated with a fixed gap? 

14.16 The normal functioning of a heart requires that all heart muscles receive the supply 
of fresh blood in adequate amount. During a heart block, one of the coronary ar¬ 
teries develops an obstruction. As a remedy, a surgeon removes a piece of vein from 
some other part of the patient’s body and grafts it across the coronary obstruction. 
This grafted coronary bypass restores the normal flow of blood to the heart muscle. 
Assume that the flow rate after the obstruction develops drops from Q to Q a . Using 
Poiseuille’s equation for viscous flow in a pipe, find the required length and diameter 
of the grafted vein. 


Copyrighted material 



APPENDIX A1 


Coordinate Frames 


All natural phenomena occur in space and in the course of time. In order to characterise a 
point in space where an event occurs, every point in space has to be labeled. It is assumed 
that free space or vacuum is homogeneous and isotropic, that is, all points in free space 
and all directions in it are equivalent. Points in space are labeled by means of a coordinate 
system. Any point in space can be chosen to be the origin of the coordinate frame. Thus 
each coordinate frame must have an origin. The origin of any physical coordinate frame is 
always associated with a material particle. We shall now describe a few types of coordinate 
systems that are useful for descriptions of natural phenomena. 

In the three dimensional Euclidean space where we live, any point can be uniquely spec¬ 
ified by an ordered set of three real numbers called, by definition, the coordinates of the 
point. A coordinate frame is designed in such a manner that this assignment of the co¬ 
ordinates to all points in the 3-D Euclidean space is unambiguously performed. However, 
there can be coordinate frames that are restricted to a limited region of space. Now, at 
any given point, it is possible to find out a set of three lines (curved or straight lines) along 
which alternately only one of these three coordinates changes while the other two coordi¬ 
nates remain constant. Such a set of three lines, known as coordinate lines, exist for every 
point in the space over which the coordinate frame is defined. If these coordinate lines at all 
points are found to be mutually perpendicular, the coordinate frame under consideration is 
called an orthogonal coordinate frame, and the system of coordinates are called orthogonal 
coordinates. Furthermore, if any or all coordinate lines are found to be not straight lines 
(in the Euclidean sense), the system of coordinates are called curvilinear coordinates. 


Al.l ORTHOGONAL COORDINATE FRAMES 
Al.1.1 Rectangular Cartesian Coordinate Frame 

It is the simplest and most natural coordinate system. It is constructed by tracing from 
the origin O three mutually perpendicular straight lines, denoted by OX, OY and OZ and 
called x-, y- and z-axes respectively (see Fig. A.l). Choosing some convenient units, these 
axes are graduated, so that any point on these axes has a measure of length expressed as 
a number in the above unit, representing the distance from the origin, of the point along 
the axis concerned. If the point is not on any one of these axes, then the rectangular 
projections of the line OP are drawn on these axes giving, say, OL = x, OM = y and ON 
= z, respectively. The ordered set of these three numbers ( x,y,z) are by definition called 


Copyrighted materia| 



526 Classical Mechanics 


the rectangular Cartesian coordinates of the point P with respect to the given Cartesian 
coordinate frame OXYZ. The coordinate lines drawn through any point P are always parallel 
to the respective coordinate axes, which are by definition mutually perpendicular. Hence, 
this is an orthogonal coordinate frame. 



Fig. A.l Rectangular 3-D Cartesian coordinate frame 


Sometimes one may like to define a direction in space for point P. In that case, a knowledge 
of distance OP = r is redundant. Instead one defines ratios ni = OL/OP = z/r, 
«2 = OM/OP = y/r and n 3 = ON/OP = z/r. The set (ni,n 2 ,n 3 ) is called the direction 
cosines of the direction OP. Obviously they satisfy n\ + n\ + n\ = 1, implying that only 
two of them are independent, which is perfectly justified as a direction requires only two 
independent coordinates to be specified. 

Since vectors are defined to be directed magnitudes, OP = r will be the most natural 
notation for position vector of the point P with reference to the origin. So a unit vector in 
a given direction is the vector having unit magnitude. Let us use hats for representing the 
unit vectors, for example r for the unit vector along r. We use the unit vectors <, j and 
k for the z-, y- and z-axes respectively. They constitute an orthogonal basis set of vectors. 
We know from the rules of addition of vectors that several vectors can be added simply 
by either constructing parallelepipeds with sides and directions given by the vectors or by 


Copyrighted material 




Appendix A1 527 


following from tip to tip of the vectors in succession (the latter operation is allowed by the 
very definition of vectors; being simply the directed magnitudes, they can be transported 
parallel to themselves to anywhere in space), thus giving 

r = xi + yj + zk (>41.1) 

and 

f = mi + n 2 j + n 3 k 

The rectangular Cartesian frames that have axes pointing towards fixed directions in 
space with the origin not accelerated with respect to most distant stars and galaxies are the 
nearest approximations to inertial frames. 

Very often, one needs to construct a number of rectangular Cartesian frames with their 
origins separated and axes rotated by fixed amounts.' The coordinate relations between the 
sets of coordinates, (zi,Z 2 ,z 3 ) and (zj ,x' 2 ,x' 3 ), in two such rectangular Cartesian frames, 
say S and S' respectively, are given by 
s 

x'i = Y, + bi i = 1,2,3 (41.2) 

j= i 

where b = 00 ' = b\i + b^j + b^k is the the separation between the two origins and 
Oij = cos Zi ,1 are the direction cosines between unit vectors along coordinate axes x\ 
and Xj. The derivation of the inverse relation, namely z, = a ji x j + c** is left ^ 111 
exercise. 

It should however be pointed out that rectangular Cartesian coordinate frames are just a 
special class of all possible Cartesian coordinate frames. A Cartesian frame, by definition, 
has all the axes as straight lines fixed in space. The’ angular relation between the axes 
is a matter of choice. When the axes are not mutually perpendicular at the origin, such 
a Cartesian frame is called an oblique Cartesian frame. We shall briefly discuss them in 
section Al. 2 . 

Al.1.2 Spherical Polar Coordinate Frame 

In this frame any two of the rectangular Cartesian coordinate axes are retained intact, 
and the third coordinate axis is defined directly along the direction of the point whose 
coordinates are to be found out. So depending on the location of the point P, this new axis, 
called the radial axis, would have to orient itself in that direction (see Fig. A.2). So the 
radial axis is not a fixed axis in space, and therefore any such frame cannot be regarded as 
an inertial frame. Then out of the two Cartesian coordinate axes, one (usually the 2 -axis) 
is marked as the polar axis, and the other (usually the z-axis) as the equatorial reference 
axis in the equatorial or azimuthal plane of the frame. Being Cartesian in nature, both the 
polar axis and the azimuthal plane are fixed in space. 

Now the spherical polar coordinates are defined as follows: The radial coordinate r is the 
projection of OP on to itself, the radial axis. Hence, r = OP. The polar coordinate 6 is 
defined to be the angle between the polar axis and OP, that is, 6 = ZZOP. So 6 is zero 


Copyrighted material 



528 Classical Mechanics 



X 


Pig. A.2 Spherical polar coordinate system 


when OP is aligned with the z-axis, is tt/ 2 when OP lies in the equatorial plane, and attains 
its maximum value of ir when pointing towards the — z axis. The azimuthal coordinate 
<t >, the third, is defined to be the angle between the z-axis and the equatorial projection 
of OP. This can vary between 0 and 2ir. So r = constant defines the surface of a sphere, 
B = constant defines the surface of a cone with the apex at the origin and axis coinciding 
with the polar axis, and <f> = constant represents a half plane bounded on one side by the 
entire polar axis. So through any point P, out of the three coordinate lines, only the radial 
coordinate line is a straight line, the other two are circular arcs. Hence, this is a curvilinear 
coordinate system. 

Now, if the equatorial projection of OP is denoted by OQ = p, it is easy to see that 
P = r sin 6. So the geometrical correspondence between the rectangular Cartesian and the 
spherical polar coordinate systems with a common origin and z-axis as the polar axis is 

x = r sin# cos 0 y = rsin#sin<£ and z = rcos# (Al.3) 

and in vector notation 

r = r f (A1.4) 

Obviously, 

r = sin 0 cos <fd + sin 6 sin <f>] -f cos 0k (Al.5) 


Copyrights 





Appendix A1 529 


One can define two other unit vectors in the spherical polar coordinate system in the same 
spirit as they are defined for the rectangular Cartesian frame. For example, 9 should be 
defined to point in the direction at P in which only the ^-coordinate increases, which is in 
fact given by 

b = r(9 + ^,0) = cos 9 cos <f>i + cos 9 sin <f>j - sin Ok (41.6) 

Similarly, + is defined to point in the direction from P in which only the ^-coordinate 
increases. This direction lies in a plane parallel to the equatorial plane, hence 

b = r(9 — ^ ,<f> + ^) = - sin <f>i + cos <f>j (^l- 7 ) 

It is now easy to check that the three unit vectors (f,d,^) defined at the point P are locally 
tangential to the coordinate lines and are also perpendicular to each other, and hence form 
a complete set of locally orthogonal basis (unit) vectors. So the spherical polar coordinate 
systems are orthogonal curvilinear coordinate systems. 

Al.1.3 Cylindrical Coordinate Frame 

In this frame, the z-axis and z-axis are as usual kept intact, but the third axis, called the 
cylindrical radial axis, is redefined so as to point along OQ, the equatorial projection of OP. 
So the cylindrical radial coordinate becomes OQ = p. The azimuthal coordinate <f> of the 
spherical polar coordinate system continues to be the same except for a change of its name 
to cylindrical polar coordinate. The third coordinate is chosen to be the original Cartesian 
z-coordinate. So in terms of the cylindrical polar coordinates (p,#,z), 

x = pcoscf) y — ps\n<f> and z — z (41.8) 

and in vector notation 

r = pcos^t + psin^y + zk. (41.9) 

The distinction between the nomenclatures of ‘spherical polar’ and ‘cylindrical polar’ arises 
from the fact that r = constant represents the surface of a sphere but p = constant 
represents the surface of an infinite cylinder. They are ‘polar’ because of the symmetry 
about the polar z-axis. Proceeding similarly, the orthogonal set of unit vectors af the point 
P for the cylindrical polar coordinate frame is given by 

p = cos0* + sin ^ 3 * ^ = - sin (pi + cos <f>j and k (41.10) 

So this is also a case of a curvilinear orthogonal coordinate system. 

In fact, one can slightly redefine the instantaneous coordinate axes of the cylindrical polar 
frame as the same old rectangular Cartesian frame but rotated about its z-axis by an angle 
<f>, so that the cylindrical radial axis is redefined as the new z-axis, say z'-axis. Therefore, 
the new Cartesian y-axis, say y'-axis will also be rotated by an angle <f> in the common 
plane of z-y and z'-y' axes. The unit vectors (* ) are just the same as (p,^), and hence 

from Eq. (A1.10) 

z' = zcos <f> + y sin 4 > and y' = -zsin^ 4- y cos# (41.11) 


Copyrighted material 



530 Classical Mechanics 


Equation (Al.ll) can also be obtained from Eq. (Al.2) for the above rotation by an angle 
0 about the z-axis. 

The plane polar frames are just a special case of both the spherical polar and cylindrical 
polar frames, as they are confined to the equatorial plane. The set (p, 0) are the coordinates. 

Al.1.4 Orthogonal Parabolic Coordinate Frame 

We move from the cylindrical polar coordinate frame to an orthogonal parabolic coordinate 
system having the coordinates x, C 0. This is effected by the relations 

* = j(x - 0 P = %/x< (A\.X7) 

The coordinates \ and C take values from 0 to oo. The surfaces of constant \ and C are 
two families of paraboloids of revolution, with the z-axis still as the axis of sym metry. Th ese 
equations can be rewritten in terms spherical polar radial coordinate r = y/p 2 + z 2 = 
(X + 0 / 2 , as 

X = r + z and ( = r - z (A1.13) 

Al.1.5 Prolate Spheroidal Orthogonal Coordinate Frame 
The prolate spheroidal coordinates (£,i/,0) are defined as 

x = jv/(e - i)(i - 1 1 ) cos* 

V = jV({* - 1)(1 - 7) sip* 0 41 - 14 ) 



where 1<£<oo, - 1 < i? < 1, O<0<27r, and d = interfocal distance. 

The surfaces of constant £ are confocal ellipsoids 4z 2 /d 2 ^ 2 + 4p 2 /d 2 (^ 2 - 1) = 1, 
of which, say F and F' are the foci. The surfaces of constant rj are the hyperboloids 
4 z 2 /d 2 77 2 - 4p 2 /d 2 (l - T} 2 ) = 1, also with the same foci F and F\ The distances ri and 


T 2 to points F and F' on the 

z-axis for which z 

= ±d/2 , are given by 


ri = K l ~i 

)’+*» 

and 

Ti = v/( z + 0 + ^ 


which means 

r i = (£ 

- *?)/2 

and 

r, = (« + <t)/2. 

(A1.15) 


Some of the most important orthogonal curvilinear coordinate systems are introduced 
here in detail, because orthogonality will be found to be the most important criterion for 
separability of coordinates for solving the Hamilton-Jacobi partial differential equation of 
motion, to be discussed in chapter 10. 


Copyrighted material 



Appendix A1 531 


A 1.2 NONORTHOGONAL OR OBLIQUE COORDINATE FRAMES 

When one or a few or all the coordinate lines of a coordinate system deviate from the 
condition of being mutually perpendicular, the coordinate system is called nonorthogonal 
or oblique. In such cases, the unit vectors defined along the coordinate lines are not mutually 
orthogonal at all points. We know that any three linearly independent vectors can form a 
basis set that is sufficient to describe any arbitrary vector in the 3-D Euclidean space. But 
the choice of the basis set is not quite unique for oblique coordinate frames. The coordinates 
will obviously depend on the choice of the base vectors. Hence one can have more than one 
choice of the set of coordinates for any given point, depending on the choice of the basis 
set. One obvious choice of the basis set is of course the standard one, namely the unit 
vectors along the coordinate lines, and the coordinates obtained correspondingly are called 
contravariant coordinates, and the base vectors are called covariant basis vectors. The other 
equally feasible choice of the basis vectors is to take the normal directions to the coordinate 
surfaces at the point under consideration (coordinate surfaces at a point are those surfaces 
over which a particular coordinate remains constant). In the case of orthogonal coordinate 
systems, the normal to the coordinate surface for any coordinate at any point is perfectly 
aligned with the local tangent to the coordinate line for the same coordinate. So one may 
argue, why should not the normals to the coordinate surfaces be considered for the candidate 
of the base vectors? In fact, if we have to deal with vectors of all possible kinds arising in 
diverse physical situations, one cannot help but accept two mutually complementary basis 
sets of vectors at each point of the space under any frame of reference, particularly for the 
oblique ones. The latter basis set of vectors, called the contravariant base vectors, are used 
to define the covariant coordinates of the same point, as well as the covariant components 
of any vector in general. 

Al.2.1 Oblique Cartesian Coordinate Frame 

This is the most simple case of a nonorthogonal coordinate frame having all the axes as 
straight lines fixed in space, but the angles between the axes are in general not right angles. 
We demonstrate a case in two dimensions (see Fig. A.3). OX and OY are the rectangular 
Cartesian axes. Let us choose OY' as an oblique axis that makes an angle a ir/2) with 
the axis OX, so that XOY' can represent an oblique Cartesian frame. 


The directions of the two sets of basis vectors at an arbitrary point P are shown in the 
figure. We have followed the standard conventions for denoting contravariant and covariant 
components of any vector, say the position vector, with a superscript index for contravari- 


ance and subscript index for covariance. The directions of the covariant base vectors are 
denoted in the figure by g x and g 2 , whereas for contravariant ones by g l and g 2 , as we are 
dealing with a 2-D case. The base vectors need not be unit vectors, in fact, in most cases 
they are not. In this particular example, we have for the position coordinates of P 


x 1 = x - y cot a y 1 = yicsca 

ss x yi = xcosa + ysina 


(A1.16) 


Copyrighted material 



532 Classical Mechanics 



Fig. A.3 Oblique 2-D Cartesian coordinate frame showing covariant and 
contravariant basis vectors 

where x,y are the rectangular Cartesian coordinates, x l ,y l are the contravariant position 
coordinates, and X\,yi are the covariant position coordinates in the oblique Cartesian 
frame, of the same point P. 

Obviously, for a = 7t/2, all these definitions are equivalent. So there is no ambiguity 
in defining the rectangular Cartesian coordinates of any point. In orthogonal curvilinear 
coordinate frames, the directions of the two basis sets of vectors are also identical but these, 
not always being unit vectors, have usually different magnitudes, the determination of which 
will be considered in Appendix A2. 

We cite just one example of nonorthogonal curvilinear coordinate frames. 

Al.2.2 A 2-D Nonorthogonal Parabolic Frame 

Consider the following parabolic coordinates u and v in the first quadrant of a rectangular 
Cartesian frame defined in a plane, given by 

x = uv and y = ^(u 2 + v 2 ) (Al.17) 


Copyrighted 



APPENDIX A2 


Vector Calculus 


The vector formulation started appearing in physics through the work of Josiah Willard 
Gibbs (Vector Analysis 1901) and is extremely useful in representing and dealing with phys¬ 
ical quantities having both magnitude and direction. In this appendix we summarise the 
necessary parts of vector calculus which will be used in the main text of the book. We shall 
be dealing with ordinary vectors without making any distinction between the contravariant 
and covariant types until we come to section A2.9. In orthogonal coordinate systems such 
distinctions are not so important. 


A2.1 INTRODUCTION TO KRONECKER DELTA AND LEVI-CIVITA SYM¬ 
BOLS 

We would like to introduce Kronecker’s Sij symbol through the following elementary scalar 
product relations. 

6ij = t j (A2.1) 

where i and j are any two orthogonal basis vectors (unit vectors) in a 3-D Euclidean space, 
so that 

= 1 Pr ° vided < “ i (42.2) 

= 0 otherwise 

So Sij represents the unit matrix of rank 3. 

The following properties of Sij can easily be proved: 

Sij = Sji SijSki ^ SikSji SijSjk — Sik and Sa = 3 (A2.3) 

where Einstein’s summation convention, namely the repeated index in a term implying a 
sum over that index running from 1 to 3, is assumed. So when we use Einstein’s summation 
convention to express any vector «, it would simply be a — ttjt, as i being the repeated 
index here implies the sum aji + a 2 2 + asS. One should keep in mind that 1 = t, 2 = j 
and 3 = k, if we link them with the notations used in Appendix Al. So following this new 
convention, the scalar product of any two vectors a and b in a 3-D Euclidean space would 
be given as 

a b = (dit) • (bjj) i,j = 1,2,3 

Bringing the scalar coefficients together and using Eqs (A2.1) - (A2.3), we get 

a b = a t b } (i ■ j) = aibjSij = aibi (A2A) 


Copyrighted 



Appendix At 535 


which obviously agrees with the usual definition of the scalar product of two vectors. Note 
that bjSij = S= bi. 

We would like to define the Levi-Civita symbol ey* through the equation 

ey* =t*(jxi) (42.5) 

where x symbolises the usual vector product. 

It is easy to check that ey* is antisymmetric in t, j, Jfe, that is, it changes sign under 
any odd permutation of i, j, k. Thus, |«yk| = 1 only if all the indices i, j and k are 
different, while ey* = 0 if any two or all the indices are identical. So out of a total of 
3 x 3 x 3 = 27 elements of ey*, only 6 survive, and the rest 21 elements are simply 
zeros. For a right handed coordinate system, ey* = + 1 if i,j,k are in cyclic order of 1,2,3 
and ey* = - 1 if are in the reverse cyclic order of 1,2,3. For a left-handed system, 
the above rules are exactly opposite. 

Now the *th component of the vector (cross) product of any two arbitrary vectors a and 
b can be written as 

(a x b)i = • •(<» x b) = Mtoj) x (b k k)} i,j,* = 1,2,3 

giving 

(• x b)i = <.(3* x k)ajb k = Ujkajbk (42.6) 

Here in Eq. (A2.6), i is not a repeated index on the right-hand side, but both j and 
k are. On expansion, this actually represents nine terms but only two of them would be 
surviving, thus matching with the expectation from a vector product. Levi-Civita symbols 
are also known as ‘permutation symbols’, as they are used to define the determinant of any 
say 3x3 matrix A, as follows 


det||A|| = tijkAnAjtAki 

ihrough the fol- 
(42.7) 


€* jkf-ilm ~ $ jib km ~ bj m bkl 

tijkUjl = 2 bkl (42.8) 

Cijktijk = 2bkk = 6 

Using the first of the Eqs (A2.8), the following identity can easily be established 

c yk c »/m "1" tijmtikl "f" ^ijl^imk ~ 0 (42.9) 

It is also easy to check that ey*^* = 0. Most of these relations will be required in chapter 
9. It may be noted that all polar vectors, such as position vector, velocity vector, etc., 
can be expressed without involving any ey* symbol, but all axial vectors, such as angular 


The Levi-Civita symbols are connected to the Kronecker delta symbols 
lowing identity 

/ bu S im b in 

tijktlm 


(bi, b im bi n \ 
= detof I bjl bjm bj n J 
\ bkl bkm bkn) 


Equation (A2.7) readily leac(s to the following identities 


Copyrighted material 



536 Classical Mechanics 


velocity, angular momentum, magnetic field, etc., will irreducibly involve an factor in 
their expressions. In fact, the matrix representation of explicitly reflects the handedness 
of the coordinate frame used. Since the product of two eijk matrices is always expressible 
in terms of the Kronecker delta symbols, the product of any two axial vectors gives a polar 
vector. Arguing similarly, we can say that the scalar product between an axial vector and 
a polar vector cannot give an absolute scalar, but a pseudoscalar, whose sign would depend 
on the handedness (right or left) of the coordinate frame. 

Exercise 

Use Kronecker delta and Levi-Civita symbols to prove the following identities 

(i) a • (6 x c) = b ■ (c x a) = c- (a x b) 

(ii) a x (b x c) = (a • c)b — (a • b)c 

(iii) (a x b) (c x d) = ( a c)(b d ) - (a d)(6 c) 

(iv) (a x 6) x (c x d) = [a,c,d\b — [b,c,d\a = (a, b,d\c — [a,b,c]d 
where (a, 6,c] stands for the scalar triple product a - (b x c). 


A2.2 PARTIAL DIFFERENTIATION OF VECTORS AND SCALARS 


Let 4>(x) be a scalar valued function of a vector argument x = (xi ,yj, zk) where (x,y,z) 
are the rectangular Cartesian coordinates of the point defining x with respect to a given 
orthonormal basis (i,j,k) of the 3-D Euclidean space. We define an operator V operating 
on and yielding a vector V<£ through. 


V<f» 


d<p- d<j>: 

T*' + V + Tz k 


(A2.10) 


and therefore, 


** = Tx dx + 


Ty dV + Tz iZ = V4 " dr 


For a given magnitude of dr, d<f> is maximum only if dr is aligned parallel to V<J, or in 
other words, the direction of the vector V<f> is that in which the space rate of change of <p 
is maximum. This is the reason why V<t> is often called ‘grad <f>' or the gradient of <f>. It is 
now obvious that 

(V),=i.V=|- and gi=< y 

which is another definition of Sij. Note that we have returned to the notation of (xi,X 2 ,X 3 ) 
for (x, y,z). The divergence of a vector field A(x) is defined to be a scalar product of V 
operator and the vector field A(x): 


Carl Gauss proved in 1839 that the divergence of a vector field at a given point represents 
the net flux of the vector field across any arbitrarily small closed surface around the point. 
The ith component of the curl of a vector field A(x) at a given point is defined by the 


Copyrighted material 



Appendix A 2 537 


vector product of the operator V and A: 


(curl A)i = i-(V x A) = 


(A2.12) 


In 1854, George Stokes gave a physical meaning to the curl operation. The quantity 
corresponding to the curl of a vector is the circulation of the flow of any vector field (for 
example, the circulation of fluid flow). 

One can now easily prove all the V operator identities, using Sij and ey*. We demon¬ 
strate here with an example. 


dAj ^ dB, 
Z 6ijk ~fc~ Bk + €ijk ~fc, 
-■ (V x A) k B k - (V 
= B (V x A) 


B)jAj 
- A (V x B) 


where A(x) and B(x) are any two vector fields defined over the same region of space. 
Exercise 

Prove the following vector identities involving the V operator 


(i) V(r n ) = nr" *r (ii) V • r = 3 (iii) V x (w x r) = 2u> 

(iv) V • (4>A) = (V*) • A + <j>( V • A) (v) V x (V*) = 0 

(vi) V x (<f>A) = (V*) x A + <£(V x A) (vii) V • (V x A) = 0 
(viii) V x {A x B) = (B • V)A - B(V • A) - ( A ■ V)B + A(V ■ B) 

(ix) V(A • B) = (B ■ V)A + (A • V)B + B x (V x A) + A x (V x B) 

Show that if both divergence and curl of a vector field A(r) are specified, that is, if V • A = 
p(r) and V x A = H(r) axe given, the solution for A is unique apart from a trivial constant 
vector and is given by A(r) = r) — ~(r x H). 


A2.3 ORDINARY DIFFERENTIATION OF VECTORS 


Let r = r(t) be a vector function depending on a single parameter t. One obvious 
interpretation of r(t) in physical situations would be the position vector of a particle as an 
explicit function of t as time. We can write r as (x(t),y(t), z(t)) where ( x,y,z ) are the 
rectangular Cartesian coordinates of the particle at time t. The instantaneous velocity and 
acceleration of the particles are then defined as 


dr (dx(Q dy(t) dz(t) \ 
K ) dt \ dt ' dt ' dt ) 


and 


«(<) 


= <Pr(t) = / <Px(t) dPy(t) d*£(<)\ 


dt* 


V dt 2 


dt* J 


(A2.13) 

(A2.14) 


Copyrighted material 


538 Classical Mechanics 


If A(t) and B(t) are two vector functions of a single parameter t and <f> is a scalar 
valued function, then 

“ d k* A) = % A + * d 4 ( ^ 215) 

From the Eqs (A2.15), it is now easy to see the following vector conditions for constancy of 
vectors and of their magnitudes: 

(i) if dA/dt = 0 , A = const. 

(ii) if A ■ (dA/dt) = 0, then |A| = const, and 

(iii) if A x (dA/dt) = 0 , then dA/dt is parallel to A implying that A has constant 
direction. 


Further, we have for a = A(r,t) 

dA 


dA 


dA J 

to* + 


dy 


dy + 


dz ■+• 


fr* = + w* 


or 


dAi 


dAi . QAi 

+ -m dt 


A2.4 VECTOR INTEGRATION 


(>12.16) 


This involves integration of a vector field specified over a region of space in R 3 or a region on 
a surface or on a segment of a curve. These integrals are expressed as multiple integrals over 
the parameters describing the regions of space which then are evaluated. We shall not pause 
here to detail this theory of integration and the important integral theorems like Green’s 
theorem (1828), Gauss’ divergence theorem (1838) and Stokes’ theorem (1854). We merely 
give the statements of the above theorems. Their proofs and the problems based on them 
have to be dealt as a part of an independent course on vector calculus. In everything that 
follows we shall assume that all functions are C 1 , that is, these functions have continuous 
partial derivatives. 


Green’s Theorem 
Let 

F(x,y) = (P(x,y),Q(x,y)) 

denote a vector field defined over a region in a plane. Let C : [a, b] —► U be a curve in 
this region parametrised by a parameter t. Then we have 

J c F dr = J*F(C(t)) C'(t)dt = P(x,y)dx + Q(x,y)dy (A2.il) 

The last integral is abbreviated as J c P dx + Q dy. 

Now the statement of Green’s theorem is as follows: 

Let P , Q be C 1 functions on a region A which is the interior of a closed piecewise C l path 


Copyrighted material 



540 Classical Mechanics 


Stokes’ Theorem: 

Let 5 be a smooth surface in R 3 bounded by a closed curve C. Assume that the surface 
is orientable and that the boundary curve is oriented so that the surface lies to the left of 
the curve. Let F be a C 1 vector field in a region containing 5 and its boundary. Then 

J{V x F) h da = J F dl (A2.21) 

There are other usable forms of Gauss’ and Stokes’ theorems, namely 

j (V x F)dV = J dS x F and J <f> dr = J dS x V* (A2.22) 

Also Green’s theorem can be developed into two identities, namely 

J (<f>V 2 xJ> + V0 • V0) dV = J (^V0) • dS (A2.23) 

and 

J - rpV 2 <f>)dV = J • dS (A2.24) 


A2.5 TANGENT, PRINCIPAL NORMAL AND BINORMAL OF ORBITS 

Let r(s) represent the orbit of a particle, which is a curve in the 3— D Euclidean space 
parametrised by s, the arc length measured along the curve from some chosen point on 
the orbit. Any neighbouring point on it has a position vector, say r + dr. Obviously, 
|dr| = ds, and hence dr/ds is a unit vector pointing along the tangent at the point and 
toward the increasing value of s. Thus, the unit vector (see Fig. A.4) 

i = | (A2.25) 

evaluated at a point on the orbit having the parameter value s = s 0 defines the direction 
of the tangent to the curve at the point r(s 0 ). It is called the unit tangent vector. 


Now since t t = 1, on differentiation with respect to s it gives 

which means that di/ds is perpendicular to t. Thus we can write 


dt 

ds 


= nn 


(A2.26) 


where n is a unit vector in the direction of (di/ds) and hence perpendicular to t. This 
definition of n also suggests that n must lie in the tangent plane passing through an 
infinitesimal segment of the curve at the point concerned. Hence k = \dt/d$\ is called 


Copyrighted material 


Appendix At 541 



Fig. A.4 Directions of unit tangent, unit normal, and unit binormal vectors 
at any point of the trajectory of a particle in 3-D space 


the curvature of the orbit or of the curve r(s). n 1 would be, by definition, the radius of 
curvature of the orbit, n is called the principal normal unit vector. 

Exercise 

Show that if an orbit lies in a plane and is described by a curve r(s) in terms of sire length 
s and its slope angle 9 , the curvature k = \dt/ds\ = dO/ds. 

Since we have been able to define two unit vectors at a point mutually orthogonal, the 
third unit vector orthogonal to both these unit vectors can be uniquely defined through the 
vector product t x n. This is how the binormal unit vector b is defined, 

b = t x n 


Again, using the fact that b ■ b = 1, we get 0 = 5- ( db/ds) implying that ( db/ds ) is 
perpendicular to b. Also since t b = 0, we get on differentiation, 


of which the first term vanishes, proving finally the orthogonality of t and db/ds. This 


Copyrighted 



542 Classical Mechanics 


means that db/ds must be parallel to n and we can write 


db 

ds 


= rh 


(42.27) 


The above equation defines another scalar constant t, which is .called the torsion of the 
orbit or curve r(s) at the given instant. Further, since 2, n and b form an orthonormal 
triad 

» - b x 2 


we get, 

— = ^xi + 4xf = -ri-»d (42.28) 

ds ds ds 

Equations (A2.25) to (A2.28) are known as the Frenet-Seret formulae. 

We shall now use these formulae to study an important genera of curves called evolutes 
and involutes. At the moment, these concepts may appear quite abstract, but will find a 
place in chapter 7. 

Definition 

If there is a one-to-one correspondence between the points of two curves Ci and C 2 such 
that the tangent at any point of Ci is a normal at the corresponding point of C 2 , then Ci 
is called an evolute of C 2 and C 2 , an involute of Ci. 


Suppose the equation for the evolute curve Ci is given as r = /(«). We want to find 
out the equation for its involute C 2 . If the distance PiP 2 (see the Fig. A.5) is taken to 
be tt, the position vector OP 2 will be R = r + ut, where r = f(s) and 2 = dr/ds. 
Therefore, 

dR / du\ - 

* = l 1 + 5jJ‘ + “' in 

Since dR/ds is parallel to the tangent at P 2 , which is normal to 2, dR/ds must be 
perpendicular to 2. Therefore, 

1 + ^= 0 or u = u 0 — s (42.29) 

ds 

Hence, the equation for the involute will be 

R = M + («. - •)% 

for any given evolute r = f(s). Actually, for each value of u 0 , there will be an involute. 
So for a given evolute, there exists a family of an infinite number of involutes. The same is 
true for a given involute. 

The simplest realisation of involutes to a given evolute is the case of winding strings on the 
surface of any object. The open end of the string, if forced to remain stretched during the 
process of winding, will describe an involute to the curve on the body traced by the winding 
thread. The latter is an evolute as the string touches it tangentially, and the open end of the 


Copyrighted 



Appendix A 2 543 


Involute 



Fig. A.5 A construction for finding the equation of an involute Cj for a 
given evolute Ci and vice versa 

string must move in a direction perpendicular to the string itself. Equation (A2.29) suggests 
that the length of the string u used up in winding is just equal to the increase in arclength & 
of the evolute along which the winding takes place. So the involute of a circle is a spiral (see 
problem 1.39). In fact, there are families of curves such that both the evolutes and involutes 
belong to the same family, such as cycloids, hypocycloids and epicycloids. The involutes of 
any cycloid are themselves cycloids. Such families of self-replicating evolute-involute curves 
are said to form iesserals. We shall see more of them in chapter 7. 


A2.6 KINEMATICS OF PARTICLE MOTION 


We can now express the kinematical quantities like velocity, acceleration and jerk of a 
particle in terms of the complete set of orthonormal basis vectors (i,n,6). Let the curve 
traced by the particle be r(s), where a is the measure of the arc length along the curve. 
We then have for the velocity 


and the acceleration 


a 


dr _ dr ds 
dt da dt 


—t 

dt 


= vi 


dv 

dt 


d?a - da to da 
to** + didadt 


dv- m 9 - 
—I + tr/cn 
dt 


(A2.30) 


(A2.31) 


Copyrighted material 



544 Classical Mechanics 


where v = ds/dt is the speed or the magnitude of velocity of the particle. Similarly, the 
‘jerk’ 

d 3 r 2 - </k . ; , 

j = — = - n 2 i + —n - nrb (A2.32) 

So one can easily see that the acceleration has two components, one component is due 
to the acceleration in the direction of motion, and the other component is the centripetal 
acceleration, and this is valid for all possible trajectories of the particle. The acceleration 
does not involve the torsion of the orbit, but the jerk does. 

The values of k and r can be obti .ned from the following two expressions 

\dr cPr d 3 rl , . 

rF-y"" U2 - 33) 


I dr (Pr I , , 

|s x M - , " i (A2M) 

As an illustration, we consider the motion of a charged particle in the field of a constant 
magnetic induction B. We wish to find the curvature and the torsion of the spiralling path 
of the charged particle at any instant. The equation of motion is 

dv , 

m dt = C ^ V X 


which implies 


dv 

-A = ° 


so that |w| = constant = v 0 , say. The solution to the equation of motion is 
v = v 0 + —{(r - r 0 ) x B) 


v B = v a B = v 0 B cos 0 

where 9 is the angle between v a and B. Taking the vector product with respect to v on 
both sides of the original equation of motion we get 

t) x ^ = —v x (u x B) = — - v 2 B\ 

dt m m 

Similarly differentiating the same equation of motion once with respect to t , we get 

(Pv e 2 f/ , 

3 = -JPT = - B v \ 


We can now obtain the curvature (/c) and the torsion (r) as 


Copyrighted material 



Appendix A2 545 


Taylor’s Series Expansion of r(s), Where s is Small 

Let r(s) be a vector function of the scalar arclength parameter s. Then r(s) can be 
expanded for small s around s = 0 as 


r(«) = r(0) + s 




ds |, = 0 2! ds* 


3? , = o 


+ ^7 |- + -y-n - kt6| 

as 


(>12.36) 


A2.7 KINEMATICS IN SPHERICAL POLAR AND OTHER COORDINATE 
FRAMES 


Figure A.2 shows the orthogonal basis vectors (f, 0, respectively for the spherical polar 
coordinates (r, 6 , <£). The relation between the sets (r, 0, and (t, j, jfc), the set of Cartesian 
basis vectors is given in Eqs (A1.5) - (A1.7). On differentiation with respect to time, one 
gets 

r = 00 + sin 0f> 6 = — Or + cos Oft and ft = — ^[sin0r + cos 00) (A2.37) 


Thus we can express the velocity and acceleration of a particle in spherical polar coordinates 


v = — (rf) = rr + rOO + rsin0</ty 


(A2.38) 


and 


dv 


a = — = (f — r0 2 — rsin 2 0<£ 2 )f + (r0 + 2r0 — r sin 0 cos 0^ 2 )0 ^ p) 39) 

+ (rsin0^ + 2»tysin0 + 2rttycos0ty 
The kinetic energy (T) and the moment of momentum ( L) can be written as 


T = |m» 2 = ^rn[r 2 + r 2 0 2 + r 2 sin 2 0^ 2 ] 


(A2.40) 

(A2.41) 


L = mr x v = — mr 2 sin 6<j>0 + mr 2 <ty 

so that 

L 2 = mV(0 2 + ^ 2 sin 2 0) (A2.42) 

The above expressions for various kinematical quantities can be reduced to the special case 
of plane polar coordinates by forcing 0 = ?r/2 (the equatorial plane) and setting r = p. 
Hence the velocity and acceleration in plane polar coordinates become 


and 


v = pp + p<t>4> 

a = (p - pf> 2 )p + (pf> + 2/tyty 


(A2.43) 


Copyrighted materi 



546 Classical Mechanics 


Note that for simple rotation about the z-axis, p = 0 and <j>k = oi (a constant vector, 
called the angular velocity vector), one obtains 

v = u X p (,42.44) 

In the expression tor acceleration (A2.43), one can easily recognise the radial, centripetal, 
Euler (actually negative) and the Coriolis (actually negative ) terms as one goes from left 
to right. 

Note also that acceleration in spherical polar coordinates has no longer any simple form 
like that in the Cartesian coordinates. Thus the acceleration in the Cartesian coordinates 
a = xi + yj + zk does not take an equivalent simple form, namely rf + rd$ + 

The fictitious forces like the centripetal, Euler and Coriolis forces appear for the rotating 
frame. We shall see more of it in chapter 3. 

The expressions for r, v and « in cylindrical polar coordinates can be obtained by 
adding zk, zk and zk to the respective expressions obtained in plane polar coordinates. 

A few otner useful vectorial quantities are defined below. 

Areal Velocity 

Given any two vectors A and B, A x B gives the area of the parallelogram spanned by 
A and B. Hence A x B is called the ‘area vector’. This is an axial vector since its sign 
depends on the sense of rotation or the handedness of the frame used to define it. In Fig. 
A. 6 , r and r + dr are the position vectors of two adjacent positions of the particle on its 
path. By definition, r x dr = area of the parallelogram spanned by r and dr. The areal 
velocity is therefore given by 

Areal velocity = f | = f = * •) = S {A2AS) 


Except for the factor 1 / 2 m, areal velocity is the kinematical counterpart of the moment 
of momentum L. 

Solid Angle 

Referring to Fig. A.7 we have an element of area vector dS characterised by its magnitude 
dS and a direction chosen to be a normal n drawn on the surface element: 

dS = hdS = dS x * + dS y ‘j + dS x k (A2A6) 

where 

dS t = dS cos a dS v = dS cos/3 dS z = dS cos 7 n = cos as + cos#; + cos 7 k 
and cos a, cos# and cos 7 are the direction cosines for n. 


Now, r • dS = dS(n r) = r dS cos 0, where 9 = ln,r. If dS„ is the projection of dS 
onto the plane perpendicular to r, the solid angle dfl subtended by dS at 0 is defined by 


_ dS a dS cos 0 r ■ dS 
= ~r - r z - -73- 


(A2.47) 


Copyrighted material 



Appendix A£ 547 



Fig. A.6 Definition of an infinitesimal area 
vector, spanned by the changing 
position vector 



o£=r 

Fig. A.7 An element of solid angle subtended by a given 
area element with respect to a given point, say 0 

A2.8 VECTORS IN ORTHOGONAL CURVILINEAR COORDINATE 
SYSTEMS 

A system of curvilinear coordinates, say (it, v, w), given as functions of rectangular Cartesian 
coordinates (x,y,z : r) in the form of either 

u = u(r) t> = u(r) w ss w(r) or the inverse relations r = r(u,v,to) (A2.48) 

is said to be orthogonal if the three coordinate surfaces (or equivalently the coordinate lines) 
at every point are mutually perpendicular (see Fig. A.8). The relations (A2.48) are assumed 
to be single-valued, continuous, continuous in first partial derivatives and with a nonzero 


Copyrighted 



548 Classical Mechanics 


Jacobian determinant. Such transformation relations between two sets of coordinates are 
called admissible coordinate transformations defined over a given region of space. Just to 
refresh our memory, given any point, one can draw a curve passing through the point in such 
a way that only one of the the three curvilinear coordinates changes along the curve and the 
values of the other two coordinates remain constant. For three curvilinear coordinates one 
can draw three such curves, all passing through the given point and mutually intersecting 
at right angles. Each is called a coordinate curve, and the surface passing through the point 
and showing a constant value of a particular coordinate is called a coordinate surface. 


w Coordinate 
line 



Fig. A.8 The network of coordinate lines and coordinate surfaces 
at any arbitrary point, defining a curvilinear coordinate 
system 


Let the equation for the coordinate curves at a point (u = u 0 , v = v 0 , w = w 0 ) be 
r = r(u,v 0 ,w 0 ) r = r(u 0 ,v,w 0 ) and r = f(tt 0 ,v 0 ,u;) (>12.49) 

The tangents to the coordinate curves are parallel to the vectors 
dr(u,v 0 ,w 0 ) dr(u 0 ,v,w 0 ) dr(u 0 ,v 0 ,w) 
du ’ dv ’ dvo 
respectively. Orthogonality of these vectors requires that 


dr dr dr dr _ dr dr 

du dv du dw dv dw 

The differential displacement along any arbitrary path of a particle is giver by 


(A2.50) 


dr , dr , dr . 

*l iu + + as * 0 


(A2.51) 


Copyrighted material 



Appendix A& 549 


thus giving the differential length of the path in the form of line element 

^ = iT iT = (£) + {%) + (Z) (A2.S2) 

= h\du 2 + h\dv 2 + h\dw 2 

where hi, h 2 and h 3 are defined through the above equation. 


Again let the coordinate surfaces be 

u = u(z,y,z) v = v(x,y y z) and w = t v(x y y,z) 

where (z,y,z) are the Cartesian coordinates with respect to some rectangular Cartesian 
frame. The normals to the coordinate surfaces are given by (Vu,Vv,Vw), which owing to 
the orthogonality, must satisfy 

0 = (Vu)-(Vv) = (Vu)-(Vw) = (Vt>) - (Vtu) (^2.53) 

We can now take the fundamental triad as 

i= *L i _ *1 and « = ^ (A2.54) 

|Vu| |Vv| |Vtn| 

Let dr i, dr 2 , and dr 3 be the differential displacements along the u, v and w coordinate 
lines respectively. We then have 

(Vu).(dri) = |Vu||dri| = du 
(Vv) • (dr 2 ) = |Vv||dr 2 | = dv 
and 

(Vti») • (dr s ) = |Vtw||dr 3 l = dw 


Further, 


ds 2 = |dr| 2 = dr 2 + dr\ + dr\ = 

du 2 dv 2 dw 2 

jwF + |wF + iv^F 

(A2.55) 

Comparing the Eqs (A2.52) and (A2.55) we get 



5* 

II 

11 “ 

§15 

implying d = hi Vu 


= m = ite) 

implying b = h 2 Vv 

(A2.56) 


implying t = h 2 Vw 



Copyrighted 



Appendix AS 551 


The Laplacian operator, in the curvilinear coordinates becomes 


V 2 0 = V • (V0) = 


i \d (h 2 hid<t>\ d d (h } h 2 \ 1 

hihthi 3u V A, du) + dv\ h 2 dv) + 9w\ h 3 9wJl 

(A2.60) 


The volume element dV , in terms of curvilinear coordinates is 


dV = |<jri||<fir 3 ||<frs| = h,h 2 h 2 du dv dw (A2M) 


Exercise 

Obtain the expressions for V</>, V • /, V x /, V 2 0 and dV in terms of spherical polar 
and cylindrical polar coordinates. 


A2.9 VECTORS IN GENERAL CURVILINEAR COORDINATES 


Since we shall now be dealing with covariant and contravariant vectors, let us use the curvi¬ 
linear coordinates ^‘(x 1 ,! 2 ,* 8 ) as functions of rectangular Cartesian coordinates (x 1 ,! 2 ,! 8 ) 
with superscript notations for the assumed contravariant nature of the coordinates, even 
though for rectangular Cartesian coordinates this distinction does not exist. Let us as¬ 
sume further that transformations between the two sets of coordinates are admissible. An 
infinitesimal displacement or separation between two points at location P defined by the 
rectangular Cartesian position vector r can be given as 

* = d £ d ^ = < A2 - 62 > 
where s are defined through the above equation, and are called the covariant base vectors 
at point P. 

These must be the^ase vectors for the (differential) curvilinear coordinates du' because 
the vector dr can be interpreted as having three components (du 1 y du 2 ,du z ) along the set 
of base vectors only. Now the Euclidean line element ds 2 becomes 


ds 2 = dr- dr = (gjdu*) (g k du k ) = (g i g k )du i du k = gj k du’du k (A2.63) 


It is easy to see that g ik as defined in Eq. (A2.63) are nine functions of curvilinear 
coordinates. As a 3 x 3 matrix, it can readily be checked that it is a symmetric matrix. 
This is known as the Euclidean metric tensor for any system of curvilinear coordinates. The 
expression for g ik is 


^ dx' dx' 

9jk = 9u : &u k 


(A2.64) 


where x* ’s are just the ordinary rectangular Cartesian coordinates of the point-P at which 
the metric tensor is determined. 

Obviously, if the curvilinear coordinates are orthogonal, fj ’s will be orthogonal, and 
therefore, the Euclidean metric tensor will be diagonal, otherwise not. So just by looking 
at an expression for the Euclidean metric tensor in terms of any given set of curvilinear 


Copyrighted material 



552 Classical Mechanics 


coordinates, one can decide its orthogonality. This is extremely useful information for 
solving dynamical problems. First of all, with the knowledge of the metric tensor, one can 
straightaway express the kinetic energy term in the expression for Lagrangian for a single 
particle, as 



(42.65) 


where ds comes from the line element. Second, the orthogonality will guarantee the sepa¬ 
rability of coordinates in HJ theory, existence of action-angle variables, etc. The velocity v 
of a particle at P is 


(42.66) 


dr dr du l 

* = It = a^ir = 

Similarly, one can find an expression for acceleration by differentiating Eq. (A2.66) once 
more with respect to time t. Third, the Jacobian determinant for the transformation from 
the curvilinear set of coordinates to the set of rectangular Cartesian coordinates is just the 
determinant of the matrix representing the Euclidean metric tensor, that is, 


= det 11^*11 (42.67) 

For admissible transformations, J cannot be zero, but it can be either positive or negative. 
If it is positive, the transformation is said to be proper , that is, a right-handed coordinate 
system is transformed into a right-handed one, otherwise the opposite, an improper one. 
This happens because of the presence of an factor sitting in the expression of the 
Jacobian and keeping track of the handedness. 

Now in many situations, the external forces F are conservative in nature, and are hence 
derivable from a scalar potential function, say 4 >(r). Using the transformation equations, the 
arguments of 4 > can easily be converted into curvilinear coordinates, and then an expression 
for force in terms of u' ’s can be obtained as follows 

(A2.68) 

The entry in (A2.68) after the last equality sign has its first factor resembling an ith com¬ 
ponent of the ‘force’ related to the ith curvilinear coordinate, and the second factor, a full 
vector, appears in the reverse order of partial differentiation when compared to definition 
of g { . It is therefore compelling to write the Eq. (A2.68) in the form of 

F = with 9' = ~ = = Vu' (A2.69) 

thus defining the contravariant base vectors which are most suitable for expressing force 
components in curvilinear coordinates. Since g' is the gradient of u', it means that g' 
must point in a direction normal to the level surfaces represented by ti 1 = constant, which 
is precisely the definition of the coordinate surface for the coordinate ti*. 

So it is now established that g t 's correspond to the covariant base vectors pointing tan¬ 
gentially to the coordinate lines, and that the contravariant base vectors g "s point normally 
to the coordinate surfaces. Also we have known that the components of a vector along the 


Copyrighted material 



APPENDIX A3 


Tensors 


A3.1 FORMAL CONCEPTS OF SCALARS AND VECTORS 

We are coming back to vectors again and again, but this time we shall study them from the 
point of view of the properties of coordinate transformations. Scalars and vectors are just 
special cases of tensors, the former being tensors of rank zero, and the latter of the rank 
one. 

It is often convenient to formulate laws of physics in some coordinate system. However, 
the equations expressing laws of nature should have an invariant meaning when we make 
coordinate transformations, just as we expect them to have an invariant meaning under 
changes of systems of units. 

In this section, we shall study 3-D Cartesian vectors and scalars. These geometric objects 
are useful in study of many natural phenomena. 

A3.1.1 Scalars 

A quantity is called scalar if it has only a single component, say <f>, in terms of any one 
system of coordinates and a single component, say 0',in the variables of any other system 
of coordinates (the two systems of coordinates being connected by an admissible transfor¬ 
mation), and if <p and <f>' are numerically equal at the corresponding points. It can be 
called a true scalar, if the physical quantity can be represented by a single number, which 
has the same value in all coordinate frames, and has no explicit or implicit dependence on 
any coordinate frame. For its representation one needs only a system of units in order to 
express its magnitude or value. To the best of our present-day knowledge, the examples of 
true scalars are the speed of light in vacuum, Planck’s constant, the electric charge of an 
electron, etc. 

There are scalars which are independent of some particular type of coordinate frames, but 
not in others. Such scalars are named after the general type of coordinate frames with respect 
to which they are independent. So, the Cartesian scalars are those that are independent 
of Cartesian frames, for example, the distance between two points in 3-D Euclidean space, 
speed, energy and mass of particles, etc., along with the list of true scalars. However, when 
we move to special relativistic dynamics, most of these above mentioned Cartesian scalars 
do not behave as scalars. The scalars in the domain of the theory of special relativity have 
to be Lorentzian scalars, supposedly independent of the Lorentzian transformations between 
4-D Minkowskian coordinate frames. Examples of Lorentzian scalars include the invariant 
line element or the 4-D Minkowskian distance between any two events, rest mass of particles, 


Copyrighted material 



Appendix AS 555 


etc. The 3-D distance between two points, mass and energy of particles are no longer scalars 
in Minkowskian frames. In fact, the relativistic mass of a particle behaves as a tensor of 
rank two. 

A3.1.2 Cartesian Vectors 

The conceptual development of vectors depends intimately on how the idea of scalars is 
perceived. For example, when one wants to build the idea of Lorentzian vectors in the 4-D 
Minkowskian space, the central scalar in the development is either the speed of light in 
vacuum or the invariance of the 4-D Minkowskian line element, the 4-D distance between 
two neighbouring events. Naturally, for Cartesian vectors, some Cartesian scalar has to play 
the key role, and this is chosen to be the Euclidean line element, or in simple words, the 
distance between any two points in the 3-D Euclidean space. 

The positions of any two arbitrary points, say P and Q, in a given rectangular Carte¬ 
sian frame OXYZ, are expressed by two unique sets of ordered numbers, say ( 21 , 22 , 23 ) 
a nd (yi,y 2 ,y 3 ). The Cartesian distance between these two points is defined as d(P,Q) = 

— Xi) 2 . Since this is a Cartesian scalar, it should be, by.definition, independent 
of all rectangular Cartesian frames. When we move to a new rectangular Cartesian frame 
OX'Y'Z' having the origin still at O, we must have another unique set of three numbers for 
each point in space, say (*' 1 , 22 , 23 ) and (yj, y^y'z) for P and Q respectively, such that 

d 2 (P.Q) = £(*• - *<>’ = f>! - *9’ M 31 ) ' 

« = 1 i = 1 

If the connection between the old and new sets ofjcoordinates for any point, say P, with 
(*i,Vi,*i) and (*' 1 , 22 , 23 ), can be expressed by any explicitly prescribed global function of 
the coordinates, then such functions are said to represent a coordinate transformation. Since 
the distance between two points are not allowed to change, the transformations should be 
homogeneous functions of coordinates of degree not exceeding one. Homogeneous functions 
of degree zero correspond to a globally uniform translation, which is prevented here by the 
choice of keeping the origins of the two frames coincident at 0. So the only alternative left 
for the transformation is to be homogeneously linear in coordinates, that is, 

3 

x'i = ^ OijXj with all a-s real const. (A3.2) 

i = 1 

Since this transformation is global, that is, valid for the entire 3-D space, we must have 
y'i = ]£ Oijl/j, and the condition (A3.1) can be satisfied (for details see the proof of Euler’s 
theorem in section 12.2), provided 

3 

^ = Sjk (A3.3) 

where Sjk is the unit matrix in Kronecker’s delta notation. 

This is precisely the condition for the matrix A = {a*,} to be orthogonal, namely its 
transpose being equal to its inverse. The message that comes out of this exercise is that the 


Copyrighted material 



558 Classical Mechanics 


So now we can define vectors in curvilinear coordinate systems in the model of the dif¬ 
ferentials of the position coordinates, preserving still the Cartesian notion of invariance of 
distances between neighbouring points, so that the components {P*} and {P # ‘} of any 
vector P in the two frames are defined to follow the rule of transformation, given by 

. Qx 1 ' 

P ,x = ™ P > (43.8) 


But there are in general two different types of position coordinates — one is contravariant 
and the other covariant for the same point P. Hence, for any given vector also, there should 
be contravariant and covariant forms of the vector with two different set of components for 
the same set of curvilinear coordinates. In section A2.9, we defined the covariant set of 
coordinates as components with respect to the contravariant basis set, which point normal 
to the coordinate surfaces. The equations for coordinate surfaces are scalar functions of 
coordinates, say ^(z 1 ,* 2 ,® 8 ) = constant. Normals to such surfaces are decided by V(f>. So 
we should model the covariant vectors in the model of the transformation of V<£, where <f> 
is any Cartesian scalar point function of coordinates. So we have 


and also 


d<j> = -—rdx' 1 
ax' 

d* = 


dx> dxi 9x '* 

and comparing the two 

d<f> d<p dx J . 

~dP ~ foidp or * “ dx ,iy 

Therefore, if {Pi} and {P/} are the covariant components of the vector P in the two 
frames, we can define the covariant vector transformations given by 


dx' 


(V*)i 


p; = 



(43.9) 


The transformation Eqs (A3.8) are taken to be the definition of the contravariant vector 
transformations. Unlike Eq. (A3.7), the matrix of the transformation in Eq. (A3.9) corre¬ 
sponds to the Jacobian matrix for the inverse transformation of coordinates. 


Thus a vector is said to be contravariant if it transforms like the differentials of the 
coordinates (but not as whole coordinates) of any coordinate system under consideration. 
A vector is covariant if it transforms as a gradient of a scalar potential function or scalar 
function. 


A3.2 Tensors 

A3.2.1 Addition and Multiplication of Two Vectors 

Two vectors can be added if their physical dimensions and the covariance (or contravariance) 


Copyrighted 




Appendix AS 559 


nature are identical. So any two covariant (or contravariant) vectors of the same physical 
dimension are allowed to be added in order to give another covariant (or contravariant) 
vector. 

However, the multiplication of two ordinary Cartesian vectors, depending on the rule of 
multiplication, can result in a scalar, vector or a tensor of rank two. The scalar product is 
defined as an inner product, and the vector product is a peculiar operation giving a vector 
pointing in a direction perpendicular to the plane of vectors. There is however another type 
of product, called thee outer product of two vectors, that gives rise to a tensor of rank two. 
These three somewhat arbitrary rules of multiplication had been invented through the years 
in order to understand the variety of natural processes. Such arbitrariness was unavoidable 
because people did not make any distinction between covariant and contravariant vectors. 
Once these distinctions were made clear, the rule of vector multiplication became simple, and 
it has just one rule. You write the two vectors side by side and transform them individually 
according to their transformation rules, and see how the product behaves. 


A3.2.2 Tensors 


Any physical quantity whose transformation properties can be identified with those of the 
product of some number of vectors, all defined in the same region of space with the same 
specified rules of coordinate transformations, is by definition a tensor. Vectors are considered 
as tensors of rank one. The rank of a tensor is determined depending on the number of 
vectors and on their nature of covariance/contravariance. 

The product of a covariant vector with a contravariant one, both described in the same 
region of space, can give a scalar (not always), but the product of two covariant (or con¬ 
travariant) vectors would give what is by definition a covariant (or contravariant) tensor 
of rank two. It has nine components in 3-D, and sixteen in 4-D. Similarly, a product of 
three vectors of the same type would give a tensor of rank three. So the product of two 
vectors can either give a scalar or a tensor of rank two. Then you may ask what happens to 
the conventional notion of a vector product. Well, it is easy to see that any antisymmetric 
3x3 matrix has only three independent elements. So 3-D antisymmetric tensor of rank 
two can be viewed as an axial vector. This only shows the total integrity of the tensorial 
operations in formulating the physical laws. 

So one can define the transformations of the second rank covariant and contravariant and 
mixed tensors as follows: 


S'ii = p'tQ'j 

s’a s W 


dx ,x dx' 3 
dx k dx 1 
dx' { dx'’ 
dx k dx 1 


P k Q l 


S kl 


dx^dx^_p p 
dx^dx'’ k 1 


dx k dx 1 

kl 


(A3.10) 


(A3.11) 


Copyrighted material 



560 Classical Mechanics 


S'\ = p^Q'. = — — P k Q t 
dx k dx' J ^ 

_ dx^_9x l k 
dx k dx'* 1 

If we now take the trace of the matrix representation of any mixed tensor, 


(A3.12) 


s '< - = iis * = s ’> {Mm 

it becomes totally independent of the coordinate frames, as the coordinate transformations 
are invertible and nonsingular. Eq. (A3.13) also defines the Kronecker delta symbol in 
the form of a mixed tensor. This is the correct form of the Kronecker delta symbol. That 
this is a mixed tensor can be proved by putting it back in Eq. (A3.12). However, the 
most important message that emerges from Eq. (A3.13) is that the trace of the matrix 
representation of any mixed tensor is a scalar. This process of reducing a mixed tensor 
into a scalar is called the ‘contraction of indices’. A contravariant index, usually denoted 
by an upper index, can always be contracted with a covariant index, usually denoted by a 
lower one, by taking a trace with reference to the indices contracted. While playing with 
the tensors, scalars are always formed by the process of contraction of indices alone, leaving 
behind no indices that are not repeated. Scalars are obviously tensors of rank zero. 


A3.2.3 Properties of Tensors 

(i) If all the components of a tensor vanish in one coordinate frame, then they vanish in 
all coordinate frames, which are in one-to-one correspondence with the given system. 

(ii) The sum or difference of two tensors of the same physical dimension and rank is again 
a tensor of the same physical type and rank. 

(ili) If a tensor equation is true in one coordinate system, then it is true in all other 
coordinate systems, which are in one-to-one correspondence. 

So it is now apparent that most physical quantities have a tensorial character. If the laws of 
nature are expressed in the form of equations or relations in terms of tensors, their physical 
reality and transformation properties can immediately be inferred, and applied or adapted 
to the physical conditions under which they can be tested or further experimented. 

A3.2.4 Concept of Tensors Simplified 

The way we have introduced the concept of tensors here might appear quite formal. We can 
perhaps tone down a little bit, and talk about these in a more informal way. 

Let P denote a Cartesian vector in the 3-D Euclidean space with components (Pi, P 2 , P 3 ) 
with respect to a rectangular Cartesian coordinate system. Let this coordinate system be 
rotated (that is, transformed orthogonally) to a new coordinate frame with respect to which 
the components of P are (P/.PjjPj). We know that these two sets are related by 

P' = A ik P k 


Copyrighted 



562 Classical Mechanics 


to the components of a single vector. Thus, given three vectors P, Q and 5, we may have 

Pi = TijkQjSk (-43.18) 

If we now require that the form of Eq. (A3.19) remain invariant under the aforementioned 
coordinate transformation, we get 

Tijk = AT, 'Tl mn A mi A nl (-43.19) 

where again A is the transformation matrix for the orthogonal coordinate transformation. 
Equation (A3.19) defines a 3-D Cartesian tensor of the third rank. The above procedure can 
be generalised in a straightforward way to define 3-D Cartesian tensors of arbitrary rank. 
The general transformation defining an rth rank Cartesian tensor is 

Tijui... = A; m l T^ opv ...An j A ok A q r-- (A3.20) 

Any physical quantity labeled by r indices having 3 r components, which transform 

according to Eq. (A3.20) under the orthogonal transformation of Cartesian coordinates, is 
called a Cartesian tensor of rank r. 

A3.2.5 Importance of Tensor Analysis 

Every big discovery in physics or in any other branch of science leads essentially to a claim 
for having found a link between two different ideas or phenomena. A physical law is merely 
a statement of this new link between two not-so-obvious ideas or phenomena. In order to 
express the law in an abstract form or say in terms of mathematical logic, an equation is 
developed, the left-hand side of which comes from one set of assumptions and logic, and the 
right-hand side originates from another set. First of all, the two sides must have the same 
physical dimensions. It is a necessity. Second, it is desirable that the form of the equation 
have general validity with respect to any frame of reference. This is possible only if every 
term in the equation has the same tensor characteristics. If this condition is not satisfied, a 
simple change of coordinate system will destroy the form of the relationship. 

Tensor analysis is in fact as important as dimensional analysis in formulating physical 
laws. Two physical quantities cannot be equated unless they have same dimensions, that 
is, any physical equation cannot be correct unless it is invariant with respect to a change of 
fundamental units. 

The use of tensors enables us to have a unified treatment, which is good for any curvilinear 
coordinates — orthogonal or nonorthogonal. This is achieved at the expense of recognising 
the distinction between contravariance and covariance. Each physical vector has two tensor 
images: one contravariant and the other covariant, depending on how the components are 
resolved. The distinction between contravariance and covariance disappears if the transfor¬ 
mations are restricted to only rectangular Cartesian coordinate systems. So when only the 
rectangular Cartesian coordinates are considered, sometimes all the indices are written as 
subscripts. 

Given any tensor of second or higher rank, it is possible to define vectors and scalars from 
it by merely devising some process of contraction of the indices. So even if a quantity on 
one side of the equation representing a law to be proposed, has an apparently mismatching 


I 


Copyrighted material 



Appendix AS 563 


rank with that of the right-hand side, the valid scheme of contracting tensorial indices and 
or coupling with the metric tensor is of paramount significance. It helps one build a valid 
tensor equation out of tensorial quantities which have non-agreeing ranks to start with. It 
is much like the process of matching the dimensions between the two sides of an equation by 
invoking a constant factor which absorbs the required balance of the physical dimensions. 

All the transformation laws defining tensors, Cartesian or otherwise, are based on some 
given coordinate transformation, which, in turn, is valid in some region of space. This, in 
general, makes the components of a tensor change as one scans through the points of the 
underlying space. Thus each component of a tensor becomes a function, whose domain is the 
region of space over which the tensor is defined and whose range is the set of real numbers 
R. When this function is specified for every tensor component (explicitly or implicitly), we 
say that we have defined a tensor field. For example, the moment of inertia tensor of a rigid 
body is a Cartesian tensor field defined over the finite extension of the rigid body. 


Copyrighted material 



APPENDIX B 


Sample of Short Questions 


These questions were actually set for the first year students of the TIFR - PU joint M. Sc. 
programme in physics during Aug - Dec 1987. We adopted a system of evaluation giving 
40% of the course credit to an internal assessment and the rest to the end of semester final 
examination. Again, the internal part of the assessment had two parts; three quarters of 
it was based on the average of performances in monthly tests, and a quarter on problem 
solving and attendance to weekly tutorials for problem solving. One had to qualify in both 
the internal and end of semester assessments. Out of about 60 students, about one fifth 
used to be nationally selected, who were usually graded in A, while the local students had 
to struggle for securing the minimum qualification. 


Class Test I 


All questions are to be attempted. 

1. A simple pendulum is swinging without any sign of damping. Justify whether it can 
be regarded as a closed and/or a conservative system. Which are the quantities that 
are not conserved? 

2. A rigid cylinder rolling on the horizontal floor of a hall comes across a vertical wall and 
starts slipping against the wall and the floor, with the coefficient of dynamical friction 
Hk- Find the forces of friction and show that the effective weight is reduced. Do these 
reaction forces depend on the instantaneous angular velocity of the cylinder? 

3. Determine the kinetic energy of a tractor crawler belt of mass m, if the tractor is 
moving with speed v and the crawler, having been geared with two wheels, rolls over 
the road withqut slipping. 

4. Using the principle of conservation of linear momentum, derive the equation of motion 
of a rocket of (continuously varying) mass m, burning fuel at a rate dm/dt and ejecting 
the exhaust gas with velocity with respect to the rocket. Under what condition 
of the rocket orientation will the speed of the rocket remain unchanged in spite of the 
rocket action? 

5. A pencil of length / placed vertically falls down. What will be the angular velocity at 
the end of the fall? 

6. How can one derive the constraint forces from a given set of constraint relations? 

7. Define virtual displacement and show that virtual work done by holonomic constraint 
forces is zero. 


Copyrighted 



Appendix B 565 


8. An incline is accelerated horizontally in order to prevent the frictionless sliding of a 
block sitting on the incline. Using D’Alembert’s principle, calculate how much hori¬ 
zontal acceleration is needed, if the incline makes an angle a with the horizontal. 

9. Show that all the gyroscopic forces can be included in the definition of generalised 
potential. Just cite an example. 

10. Construct the Lagrangian of a bicycle rolling down an incline. 

11. A rigid rod of length l is constrained to move in such a way that both of its ends 
always maintain physical contact with the inside surface of a hemispherical bowl of 
radius R (> r). Find the degrees of freedom of the system. Why do nonholonomic 
systems require a larger number of generalised coordinates than the number of degrees 
of freedom? 

12. Give an example where the constraint is not only holonomic but also rheonomic and 
bilateral. Are all the scleronomic systems conservative? 

13. Why should a Lagrangian be computed only with reference to inertial frames? How is 
this requirement satisfied when one wants to construct the Lagrangian for the motion 
of a particle in a rotating frame? 

14. Using Sij and ey* evaluate the gradients of the following scalars : |w x r| 2 and 
v ■ (b) x r), where u is a constant vector and v does not explicitly depend on r. 

15. Compare the qualitative effects of the Coriolis forces in the two geographical hemi¬ 
spheres of the earth for (a) vertical free fall of any object, (b) rivers flowing east, and 
(c) a pendulum swinging in a vertical plane. 

16. Show that the time rate of change of any vector quantity A in a rotating frame differs 
from its inertial counterpart by a term u) x A, where m is the angular velocity of 
rotation of the rotating frame with respect to the inertial frame. 

17. Why does a plumbline usually indicate the true vertical line at any place, even though 
the earth is not truly spherical in shape and the centrifugal forces due to earth’s rotation 
are not negligible compared to the gravitational forces? 

18. Derive the condition for Galilean invariance and show that the Lagrangian for a free 
particle changes under Galilean transformation only by a totally differentiable function 
with respect to time. 

19. Following Noether’s symmetry arguments, prove that the total angular momentum of 
any closed system is conserved due to the isotropy of space. 

20. Write down the Lagrangian of a free particle in the rectangular Cartesian, spherical 
polar and cylindrical polar coordinates, and interpret the significance of the changes in 
the number of cyclic coordinates in each of them. 

Class Test II 

All questions are to be attempted. 

1. What do you mean by closure, boundedness and stability of orbits under central forces? 
State their necessary conditions. 

2. Analyse the following forces to find whether they are central or not : uniform gravity 
field, tidal force field of moon acting on the ocean water on earth, and force of impact 


Copyrighted 



566 Classical Mechanics 


during isotropic elastic collision of two neutral atoms. 

3. How many independent integrals of motion are there for a planet orbiting around the 
sun? State them, both in terms of the orbital elements and in terms of the first integrals 
of motion. 

4. Explain the fact that the actual force exerted by moon on any drop of ocean water on 
earth is about 30 times larger than the tidal force that is responsible for the origin of 
oceanic tides. 

5. A heavy horizontal turntable is rotating with a uniform angular velocity u *. A sphere 
is let rolling on it without slipping. If its initial centre of mass velocity is v 0 with 
respect to an inertial observer, how will the force F acting on it change with time in 
the same inertial frame? Is this a central force? 

6. Explain with diagrams what are meant by differential scattering cross-section, extinc¬ 
tion and impact parameter. 

7. What is the virial of an interacting W-particle system? Using the virial theorem derive 
the equation of state of any ideal monatomic gas. 

8. State the conditions under which Hamilton’s and Maupertuis’ principles of least ac¬ 
tion are valid. What are the independent arguments of Hamilton’s characteristic and 
principal functions? 

9. Prove the following : d(8q) = 6{dq) and 8q = d(8q)/dt. Explain what do you mean 
by the quantity 8H. 

2 

10. Find the equation of the curve if it satisfies 6 J y ds = 0, where s is measured along 

l 

the curve (in two dimensions only) and y is measured vertically downward. 

11. What are brachistochrones and tautochrones? Under what conditions do the two be¬ 
come identical? Define and draw cycloids and hypocycloids. 

12. Find the Hamiltonian H(r,p) for a relativistic particle that has its Lagrangian L(r,v) 
- -c 2 m 0 ( 1 - u 2 /c 2 ) 1/2 - V(|r|). 

13. Show that L(q,q,t) and L'(q,q,t) = L(q,q,t) + dF(q,i)/dt produces the same 

equations of motions. Show that the transformation is canonical. - Use F 2 (q t P) = 
QP - F(q,t). 

14. What is a Routhian? Find the expression for energy in terms of the Routhian function 
only. When is the knowledge of Routhian important? 

15. Show that the Runge-Lenz vector for planetary motion is a constant of motion. Why 
is it also called the eccentricity vector? 

16. The Hamiltonian of a charged particle moving in an electromagnetic field is H(r,p,t) 
= (p - eA) 2 /(2m) + e<f>, where A(r, t) and <£(r, t) are the electromagnetic potentials. 
Show that the energy is E = (mu 2 /2) + e<f>. 

17. Show that for a simple 2-D phase space transformation from ( q t p ) to (Q,P), the Ja¬ 
cobian is unity if the transformation is canonical. Why is it an important result? 

18. Mention any six examples of physically interesting canonical transformations, and at 
least one which is much used as a point transformation but is itself not canonical. Show 
that an exchange transformation is canonical. What is its significance? 

19. Show that under any point transformation q* = qi(Q\, - ,Qn,t) * = l,...,n. the 


Copyrighted material 



Appendix B 567 


momentum and energy change in the following way : 

, dqi , dq t 

= Vi BQi * nd E = E - * m 

20.. Find the generating function F 2 (q,P) and the transformed Hamiltonian K(P,Q), if 
= \Vkrnq 2 cot Q and H{p, q) = (p 2 /2m) + (kq 2 / 2) 


Class Test III 

All questions are to be attempted. 

1. Show that for a 3-D isotropic harmonic oscillator, the Runge-Lenz tensor A{j = 
PiPj/2m + kxiXj /2 is a constant of motion, that is, its PB with the Hamiltonian 
H = |j>| 2 /2m + k\r\ 2 /2 is zero. 

2. If H = |p| 2 /2m -(- F(xi,x 2 ,x 3 ) and L is the angular momentum, show that the 
Poisson bracket [Z, H] = T = torque about the origin = r x F = -r x VV 

3. Prove Jacobi’s theorem for the time-dependent Hamilton-Jacobi theory. 

4. What are the advantages of having the H-J equation separated in all variables? 

5. What are action variables and under what conditions are they integrals of motion? 
Why should they be called ‘angle’ and ‘action’ variables? 

6. For planetary orbits, define J P> Jg and J# and specify the suitable range of integrations 
for r,6 and <f> respectively. Show that the frequencies of oscillation in r,6,<j> are degen¬ 
erate, since (J r + Jg + J+) 2 E + 2 tt 2 G 2 M 2 m 2 = 0 is satisfied. How would you 
remove these degeneracies? 

7. What do you mean by libration, rotation and adiabatic invariants? 

8. Cite any three examples of adiabatic invariants. 

9. Show that potential energy has to be minimum for a stable equilibrium. 

10. Mercury fills a U tube up to a total column length L. The tube is rocked so that 
mercury begins to oscillate. Find the period of oscillation. 

11. Find the conditions under which a normal mode of small amplitude oscillation is 
possible for a system having n degrees of freedom. The characteristic equation is 
[bij - jPaij) Aj = 0, AjS are the complex amplitudes and p is the frequency. 

12. What do you mean by normal coordinates, principal oscillations and the normal modes 
of small amplitude oscillations? 

13. Find the number of degrees of freedom of the following rigid bodies : 

(a) Your ball point pen, when you are drawing a diagram on a plane sheet of paper 
(will there be any difference if you use a fountain pen?), 

(b) a table lamp stand that has two totally flexible joints both connected via rigid rods 
(the base being immovable), 

(c) a rigid cylinder rolling without slipping on a plane surface. 

14. Find the location of the instantaneous axis of rotation of the following bodies : 

(a) the front wheel of your bicycle while in motion, 

(b) a spinning sphere at a great distance showing you no more than half of its surface, 

(c) a bus taking a left turn along any gentle curve of a road. 


Copyrighted material 



568 Classical Mechanics 


15. Interpret different meanings of the expression v = * + u) x r for the description of 
any rigid body moving in any manner. 

16. Suppose you know the moment of inertia tensor of a given rigid body about a given 
point which is not its centre of mass. Find the moment of inertia tensor about the 
centre of mass. 

17. Find the principal moments of inertia and the corresponding principal axes about the 
centre of a Christian ‘cross’ made up of two thin bars of unequal lengths. 

18. Draw the inertial ellipsoid of a thin circular disc about any point on the rim of the disc. 

19. Derive the conditions under which the usual torque angular momentum relation, namely 
T = dL/dt , is valid. 

20. A disc is rolling vertically without slipping on a horizontal plane. Why is its total 
kinetic energy is taken to be the sum of its translational and rotational components? 
Why should the mixed component be zero? Is there any point on the disc about which 
only the rotational component exists? 

Class Test IV 

All questions are to be attempted. 

1. Explain why a cyclist does not fall under the action of gravity when it takes a turn 
either to the left or to the right. How could an expert cyclist dare leave the control on 
the steering even when taking a turn? 

2. Under what conditions does a polhode reduce to a point and a circle? Show that 
herpolhodes are in general bounded curves. 

3. Why is it said that a body cone always lies outside the space cone for any freely rotating 
symmetric top? 

4. If a spinning ball is made to bounce from a rough horizontal surface with an angle of 
incidence 9\ with the horizontal, find its angle of reflection. 

5. Given the Lagrangian of a system 

L = ^A(0 2 + 4> 2 sin 2 0) + ^C(<£ cos 9 + iff) 2 - mghcos9 

find the effective potential F e ff(0) for its 0-motion. 

6. Interpret the condition for sleeping top, Cu >3 > 2\/mgh. Differentiate between a 
strong and a weak top. What is the mathematical argument for the stability of a 
sleeping top? (Just mention it.) 

7. What are the effects of friction on a fast spinning top? 

8. What is a gyroscope? State the essential property of a gyroscope. How did Foucault 
use it to demonstrate the rotation of the earth? 

9. An isotropic elastic body (Lamp’s elastic constants A, fi) is under tension in one direc¬ 
tion. Find its Young’s modulus and the dilation. 

10. If tt is the deformation vector for a homogeneous elastic solid, show that V • « is its 
dilation. How will you find the shearing part of its strain 6 ifc ? What happens if b& is 
tried for diagonalisation? 


Copyrighted material 



Appendix B 569 


11. Explain the meanings of e yx and a yx . Show that the condition <r yx = <r xy is good 
enough for the rotational equilibrium about the x-axis. Demonstrate geometrically 
that e yx = - e xy corresponds to pure rotation, but that e yx = e xy produces a pure 
shear to be given by an angle 2e yx 

12. Show that a general elastic solid can have at most 21 independent elastic constants, 
if the deformation follows the generalised Hooke’s law. How is it that a cubic crystal 
is less symmetric than an isotropic body, although cubic crystals are known to exhibit 
isotropic optical properties? 

13. Starting from the equation 

= (A + /.)V(A) + „V ! « 

show that two transverse and one longitudinal mode of wave propagation are possible 
through any isotropic elastic medium. 

14. Show that the motion of the phase fluid under conservative forces is equivalent to that 
of an incompressible fluid. 

15. Show that (i) Dr/Dt = q, (ii) pressure potential for barotropic fluids is either the 
enthalpy or Gibb’s free energy function, and (iii) for incompressible fluid flows Vq = 0 

16. State and prove Bernoulli’s theorem of fluid dynamics. Define stream tubes and vortex 
tubes. 

17. Show that in any supersonic jet, the speed of the fluid increases as the fluid is allowed 
to expand laterally. 

18. Show that the group velocity of deep water grav ity wa ves is exactly one half of thejr 
phase velocity, the latter being given by c = y/gX/2n, but for the ripples the group 
velocity is one and a half times larger than the phase velocity c = y/2nT/Xp, where 
T is the surface tension of the medium. 

19. Draw the two dimensional flow pattern for a steady irrotational flow of an incom¬ 
pressible fluid, represented by the complex analytic function /(z) = z 3 , where 
z = x + iy, x and y being real. 

20. With regard to the shape of an aeroplane, explain the origin of the lift force against 
gravity, while in motion. What are its basic orientational manoeuvrings, called yaw, 
pitch and roll? 


Final Examination 

Attempt any ten questions. 

1. State only the conditions under which 

(a) the work done by the constraint forces vanishes 

(b) the total energy becomes an integral of motion. 

Show that all the gyroscopic forces can be included in the generalised potential U (q, q, <). 
Demonstrate this with an example. 

2. Find the effective potentials due to centrifugal and Coriolis forces. 

Show that the plane of oscillation of Foucault’s pendulum rotates in the opposite sense 


Copyrighted materi 



570 Classical Mechanics 


to that of the rotation of the earth. 

Assume that the river Mutha is flowing northward (say, near the Mhatre bridge in 
Pune) with a speed of 10 km/hr (latitude A = 18° N). What will be the height 

difference of its water level between the two banks due to Coriolis force, if the width of 
the river is 100 m? 

3. If L be the total angular momentum of a planet, show that L 7 = p\ + p\l sin 2 6. 
Find the location on the orbit where the radial component of the orbital velocity is 
maximum. Find the eccentricity of the transfer orbit for launching a geostationary 
satellite from a circular orbit around the earth, that has a radius 6600 km. (Radius of 
the earth is 6380 km, and mass 6 x 10 24 kg.) 

4. Show that the virial of a system of particles moving under mutual central force po¬ 
tentials satisfies the translational invariance. Apply the virial theorem to prove that a 
star becomes unstable if the ratio of the two specific heats Cj,/c v of its constituent gas 
is less than 4/3. 

What is the form of the virial theorem for particles moving in their mutual gravitational 
fields? 

5. Find the Lagrangian and the Hamiltonian for a ray of light traveling in any optical 
medium having refractive index n(r,p). Find the equation of motion of its path. Show 
that the path followed by the light ray is a brachistochrone. 

6. Find the frequency of small oscillation of any rigid homogeneous hemisphere kept upside 
down on a horizontal table. Find the frequencies of the normal modes of oscillation 
of a system if its Lagrangian is given by £(z,y,z,y) = (z 2 + V 2 )/! - (&i* 2 + 
* 2 V 2 )/2 + ozy; k\, hi and a being constants. 

7. Show that all the univalent canonical transformations do form a group. Mention at 
least three methods of testing whether a given phase space transformation is canonical 
or not. Show that the electromagnetic gauge transformation A' = A + V/(f, t), <f> = 
<p — df/dt effected by a generating function F2(r,P) = r • P — e/(r, f) can be 
regarded as a canonical transformation from (r,p) to (r,P) 

8. Differentiate between libration and rotation. 

Determine the action-angle variables for a one-dimensional harmonic oscillator. Is there 
any adiabatic invariant of the system? 

Evaluate the Poisson bracket for a charged particle moving with velocity v in a 

field of magnetic induction B. 

9. How are the time-dependent and time-independent solutions of the Hamilton-Jacobi 
theory for any conservative system related to each other? What are Hamilton’s princi¬ 
pal and characteristic functions? What kind of action principles do they follow from? 
State the conditions under which Hamilton’s principle is valid. Just mention what sort 
of curves you get from the following variational principles: 6 /j ds = 0, 6 /j dt = 0 
and 6 J 2 yds = 0 

10. Show that a dynamical system having n degrees of freedom can have at most 2n — 1 
independent constants of motion. How are they determined by the methods such as 
due to (i) Euler-Lagrange (ii) Hamilton (iii) Hamilton-Jacobi and (iv) by the method 
of Poisson brackets? Outline the Hamilton-Jacobi method to find all the constants of 


Copyrighted 



Appendix B 571 


motion for a particle moving in the potential field V — a -r/r 3 , a being a constant 
vector. 

11. Show that (i) polhodes are closed curves, (ii) herpolhodes are bounded curves and (iii) 
steady rotations about the principal axes of any asymmetric top are stable except for 
the intermediate principal axis. 

12. Explain with diagrams the properties of the following : body cone and space cone, 
Foucault’s gyroscope, sleeping top, tippe top, Chandler’s wobbling of the earth and 
boomerang. 

13. A bicycle rider is moving with a constant speed v a along a horizontal circular track of 
radius R. The total mass of the bicycle and the rider is M, the radius of the wheels 
t 0 and the mass of the two wheels m. Find the angular velocity of precession and the 
total kinetic energy of the system. Why does the bicycle rider not fall due to the action 
of the gravitational couple? 

14. Give reasons why 

(i) any elastic body which is capable of producing homogeneous elastic deformation 
has 12 degrees of freedom, 

(ii) isotropic solids have only two independent elastic constants and 

(iii) P-wave component of the earthquakes should always travel faster than the S-wave 
component. 

Find the rotation, dilation and shear due to an elastic displacement function u = 
e[(2 - 3z + 4y - 2 zji + (1 + 2x - 5y + 7 z)j + (x - 2y - 2z)k), t being a small 
dimensionless quantity. 

15. (i) Explain the significance of Bernoulli’s theorem for fluid motions. 

(ii) Apply it to find the speed of gravity waves propagating in a deep ocean and having 
a wavelength of 100 m. 

(iii) Draw the pattern of the planar, steady and irrotational flow of an incompressible 
fluid given by the complex potential f(z) = z 2 , where z, = x + iy. 


Copyrighted material 



APPENDIX C 


Hints and Answers to 
Selected Problems 


Introduction 

1. 1 Solv e for the four unknowns in four inde penden t dimensional equations. The Planck mass mp = 
y/Kc/G ~ 10 -8 kg, Planck length Ip = y/Gh/c? ~ 10 -35 m, Planck time tp = lp/c , and Planck 
temperature Tp = mpc?/k. All the wavelengths arc comparable to one another. 

1.2 Solve for the motions of the car and ball and show that they meet. If wheels have got appreciable mass, 
it shares some kinetic energy of rotation, so the car will proceed slower than the ball along the incline. 

1.3 Set up the equations of motion of the chair and of the rider separately, assuming masses m and M for 
the chair and rider respectively, and T as the tension in the rope and a as the acceleration of the rider. 
Eliminate T to get N = (M - m)(o + g)/ 2. For upward acceleration, we need Af > m, but if m > Af, 
the rider will be lifted out of the chair. 

1.4 Set up the equation of motion of the massive object in terms of tensions in two segments, T\ — (Tj + 
Mg) = Ma. Now, a ~ 0, for quasistatic pull, but for sudden jerk, it is possible to produce a < — g, and 
hence the results. 

1.5 The spring term is 0 for r < L, the equilibrium length, and is !if(r — L) otherwise. Solve the equations 
of motion. In x, and y, the motion is simple harmonic about the origin, but for the z-axis, the motion is that 
of a displaced harmonic oscillator. 

1.8 Collision means identical position vectors at some instant <, and this leads you to arrive at a vector 
condition of the form a = b t. Define a vector product in order to eliminate t. 

1.7 You ought to get an answer virtually independent of the original height of dropping. Terminal velocity 
V T — 13 m/s. After bounce it has to again encounter drag forces and the height is reduced by a factor of In 
2, giving the final height at bounce 5.4 m 
1.0 1.3 watt 

1.10 First, from the given temperature difference, calculate the radiation loss per day using Stefan’s law of 
radiation. (In section 12.28.1 the surface area of human beings is given.) Second, calculate the total heat of 
evaporation of sweat per day. Now compare the sum with the total calorie value of the food you take every 
day. From your daily intake of calorie, calculate how many grammes of carbon can be burned, which becomes 
CO 2 on oxidation, c.g., for 2000 kcal, the total amount of oxygen to be inhaled is about 16.6 moles, for which 
so many (?) litres of air to be inhaled? The required lung capacity turns out to be about 270 cc. 

1.11 One million volts! Find the mass of rain drop. Every 18 gm of water contains Avogadro’s number of water 
molecules. Each water molecule contains how many electrons and hence how much total negative charge? The 
earth receives the equivalent amount of positive charge, and then apply the formula for the gain in potential 
of a uniformly charged spherical conductor. 

1.12 This problem can be formulated and solved in three different ways. Each turn of the tape can be viewed 
as a term of an AP series, or as a differential increment in length as well as in thickness of the tape (that is, 
handle the problem through formulating integrals), or by simply considering the increase in the volume of the 
disk as the tape winds. The radii arc 1.80 cm and 2.26 cin, ans. 


Copyrighted material 



_ Appendix C 573 

1.13 This problem is given in order to test your ability to formulate a differential equation. Set up the 
differential equation in the form of dv/dt + kv = kat and solve it. 

1.14 Don’t be afraid of black holes. You may learn a lot of black hole thermodynamics from this problem. You 
have the temperature and surface area of the black hole. Use Stefan’s law of radiation to calculate the power 
of emission of radiation. Energy loss also means mass loss (E = - Me 2 ). Written fully, it is nothing but a 
differential equation in M, whose solution will give you the lifetime of the blackholc t 0 = -5120G M /r hr . 

1.15 Take the mass-radius relation of black holes from the previous problem. The density of the universe 
p = 3c 2 /&tGR 2 ~ 8 x 10 -27 kg/m 3 , and the total number of galaxies about 2 x 10 1 . 

l.lfl The collapse time is t = ^/3*/32Gp. If this problem is inverted, it corresponds to the big bang, instead 
of big crunch. The age-density relation for the hot big bang universe is just the same as the above one! So, 
no wonder, you would get the age of the universe if you plug in the present mass density of the universe in 
the above formula. Ans. 29.5 min, 14.9 min, 19.2 min and 21 billion years. 

1.17 The angular momentum remains conserved, I\U\ = h<*> 2 , before and after collapse. During the 

collapse, gravitational energy is released. The minimum radius of the contracting sun will be achieved when 
the centrifugal force becomes strong enough to completely neutralise the force due to gravitational attraction 
on the equator. This gives the radius of the pulsar to be if > A* 2 Rq/GMqTq ~ 15 km, and^its peri 
T > R 2 Tq/RI ~ 1 millisecond. The increase in rotational kinetic energy is by a factor of 2 x 10 . 

1.18 Since all the forces involved arc conservative in nature, you can solve the problem by setting up the 
energy equation. On hitting the ground the stone loses all its energy content, about 5.85 cal. 

1.10 Solve the equations of motion in (z, y) coordinates, eliminate t, and express v\ as a function of 8 0y h 
and L. Minimise v 0 with respect to 0 o , which gives you the required relation. 

1.21 Power = magnitude of force x speed. Power lost to combat the force of drag with water during swimming 
is ~ 150 W, but the swimmer actually loses more than 10 times that for coordinating the limbs for swimming. 

1.22 It is a lengthy calculation, but will surely give you an opportunity to recapitulate certain integrations 
and tricky algebra. Best of luck! 

1.23 The effect of rotation is to effectively increase the mass! This is the lesson, if you have not already 
realised. Ans. 253 J. 

1.24 However quietly you step on the platform of any spring type weighing machine, the initial deflection 
is always twice the one that it will finally settle in! Very strange, isn’t it? Solve the problem first for h = 
1 m, and get the correct answer z = .221 m. The general formula is a solution to the quadratic equation 
x 2 - 2z„(z + h) = 0 (this comes from the energy equation). 

1.25 Carbon. Find the mass ratio of the target and the projectile from the given data. 

1.27 This is a practical method of isotope separation followed in most nuclear plants. Heavier isotopes 
experience higher centrifugal force and hence have lower centrifugal potential energy. In gas phase with an 
equilibrium thermal distribution at a given temperature, the particles with higher energy must have lower 
density (Boltzmann’s e~ E ! kT rule). So towards the rim of the centrifuge, hexafluorides of U will be more 
populated than those of U 235 . The abundance ratio of isotopes at radius r is given by 



Since {Ni/N 2 ) 0 = 139 : 1, = 352 amu and m 2 = 349 amu, the enrichment factor for the heavier isotope 

near the wall is 1.27 (in a single run). So the higher isotope fraction should be separated about 20 times in 
succession in the centrifuge in order to obtain a mixture containing up to 80% of the light uranium isotope. 
1.28 Change in the length of the spring A/ = 4usin 2 a/2. Work done for the compression of the springs must 
equal to the work done by the centrifugal forces that has moved the weights through a distance of a sin a, 
thus giving tan a/2 = \Jrnu 2 /2k. You could also set up the force equations for the balls of the governor, the 
component forces being the centrifugal force, the tension in the links and the elastic forces in the spring (the 
governor is rotating with constant angular speed with constant a). The equilibrium corresponds to zero force 
Condition. Since the expression docs not contain g , the acceleration due to gravity, the device should equally 
work in the condition of weightlessness. 


Copyrighted material 



574 Classical Mechanics 


1.29 (i) v = y/2mgh/(m + I/r 2 ), (ii)/j = mr J /2, /j = m(r 3 + rJ)/2, (iii) t = 2h/ti sin a, giving 
tj = 0.78 8 and t* = 0.88 s 

I.SO Preventing from sliding downward means the frictional force is acting upward. Equate the balancing 
forces to obtain w? = jtan(a - 4)/Rsina, where tan^ = p,. Note that preventing from sliding upward 
would give you another condition on u. What would be the state of the object if the value of u were set in 
between? 

1.31 Just at the time when the drop breaks from the wire, force due to surface tension must equal the weight 
of the drop. Ans. 34 cm 

1.32 Write down the equation of motion of the centre of mass (include the force of friction (- ft) and the 
force of reaction (JVj), where / = pN. Write also the torque equation for the cm and solve for w(t). Show 
that the motion of cm reverses at time t\ = v 0 //ig. Then sliding stops and rolling begins at, say, time * 2 , 
when u x Rj = v = instantaneous velocity of cm. Now show that tj > ti is possible only if u 0 > v 0 /R 

1.33 At the two lines of contact, calculate the normal forces and the respective forces of friction from which 
the net torque can be calculated. Form the torque equation and solve it. Remember that the frictional force 
developed at the vertical line of contact changes the normal force at the horizontal line of contact, and vice 
versa. So you either apply the principle of balancing forces, or end up in summing two series of terms. Try 
both the methods. 

1.34 Cylinder will detach if the component of the centrifugal force, in the vertical direction, exceeds its weight. 
Use the principle of energy conservation in order to calculate the magnitude of the centrifugal force, while 
rolling over the edge. The energy equation for rolling over the edge gives v 2 = v 2 + 4yR(l - co«0)/3. 

1.35 Find the condition for detachment and use energy conservation (T = -V) or balance of forces. Note 
that the absolute angular speed of the rolling ball is not the same as the one inferred from the motion with 
respect to the point of contact. 

1.36 Set up the equation of motion from the fact that at any instant only a part of the chain is accelerated 
uniformly due the driving force provided by the weight of another part of the chain. You must express 
acceleration in terms of dv/dz and integrate the equation of motion. Ans. t> = s /2ghln(l/h). 

I.S7 Apply Eq. (1.27) for the variable mass problem. Since the magnitude of force is constant and acts in a 
direction perpendicular to the motion, the curvature of the path is constant. Ans. a = (u/t> 0 ) ln(mo/m) 

1.36 Use Eq. (1.27) with F cxt = 0, - mg k, - k GMm/z 2 , for the three cases. 

1.39 It is a problem of evolute and involute. Sec Appendix A2 of the book. Ans. t = l\/2v 0 R. 

1.40 If the bit of mud leaves the rim at height y from the ground and with the vertical component of the 

speed v f , the maximum height it gains is h = y + (y 3 /2g). Now maximise h with respect to the initial 
location, say 9. Why does h - -♦ oc as v 0 -* 0? There comes the critical speed v a = y/Sg. 

1.41 During hitting, the angular momentum about the hinge remain s cons erved. Maximum swing can be 
obtained from the energy balance. Answer to part (a): v ~ ( M/m)y/2gl/2 sin(a/2), part (b) mvx(mx + 
Ml/2)/(mx 2 -f Ml 2 / 3). The impact transfers momentum to the support through the reaction at the fulcrum. 

1.43 Curvature k = 2 ba/a. 3 and the total acceleration = ay /1 + (46s‘ , /° J ) 5 - 

1.44 If v 9 and v are the velocities of the small ball before and after collision with the big ball, then 
A = v/v 0 , where A 2 is the height amplification factor. Speed before bouncing v a = y/2gh; the larger ball 
bounces with a speed V\ = eiv a \ momentum balance after collision with small ball on top of the large ball 
eiMv 0 - mv a = MV + mv and their relative velocities before and after collision must obey the relation 
v — V = ej(Vi + v a ), by definition of the coefficient of restitution. Thus, if t\ = «j = 1, and m <C M, 
A = v/vq = 3. For three balls in succession, show that A = 7 

1.45 Express Fuoea, and F qua d r4t i c in terms of v and respective terminal speeds t>j, and show that their ratio 
is just v/vt, and since v < vr, hence the conclusion. 

1.46 Show that dg(r)/dr = 4 *G{p(r) - 2p(r)/3), from which the result follows. 

1.47 Show that at a depth z, the gravity anomaly Ag(z) = 4wGoopa{l + 0.5A(1 - e"*^}. 

1.48 Characteristic vectors for diagonalizing M represents the lab to cm transformation of velocities. 


Copyrighted 



Appendix C 575 


1.49 Calculate the work done by frictional forces. 

1.50 The matrix solution e At represents the most general solution for forced damping, and as w —► A, the 
solution approaches that in the limit of critical damping. 


Chapter 1 

1.1 (i) *i + y? + = x\ + y 7 + = R 7 , (*j - *i) a + (w - yi ) 7 + (*a - *i) a = I s 

(ii) d(0P) + d(O'P) = l» - l 

(iii) d(OP) = R(t): holonomic, rheonomic, nonconservative and bilateral. In the second case, equality is 
replaced by >, which makes the constraint unilateral. 

(iv) See the problem solved in section 2.16 

(v) Each hinge acts as a centre with its emerging stem as the radius of the sphere over which its other end 
of the stem can freely move. The subtle difference between a coiled and a straight filament is that when 
rotated about their axes the former changes configuration but not the latter. So the latter would have an 
extra constraint relation compared to the former. 

(vi) At the hinge point, the motion of the crank is planar (instead of spherical when compared to the hinges 
of the previous problem), which means one constraint relations for this hinge. 

1.2 £m,r; ■ dr; = dT, hence the result. 

1.3 If you have solved problem 1.35, translate it into Cartesian coordinates. 

1.4 Write down the constraint relation in 2-D Cartesian coordinates and proceed. 

1.5 The first part is solved in many text books. For the second part, force of sliding friction is involved. 
Without slipping, the loss in pot. energy = gain in translational and rotational kin. energy. But during 
slipping, a constant frictional torque acts, the work done by the torque is simply the product of the torque 
and the net angular slip. The angular slip can be calculated from solving the rotation under constant torque. 

1.6 The length of the wire is manipulated to change periodically. The constraint is rheonomic. Virtual 
displacement will not have any radial component. Take the general expression for acceleration a = f in say, 
spherical polar coordinates, F„t = —mgk, and then apply D'Alembert’s principle (F tx t — rna ) • Sr = 0. 
Now in this equation substitute r = a + fccoswf. 

1.7 You may consult Whittaker’s book. 

1.8 Since r(A) is given, set up Lagrange’s equation of the first kind, by doubly differentiating r(A) with 
respect to time and so on, and arrive at an equation of the form A = constant. Obtain a solution in the form 
A = A(f), and substitute in r(A) in order to get the complete solution. Repeat the procedure for the second 
example. 


Chapter 2 

2.1 The number of constraints can be found from the hints to 1.1, and hence the number of DOFs. Wherever 
constant distances occur as constraints, one may use polar coordinates as generalised coordinates. 

2.2 The concept of DOF of molecular systems is extremely useful in thermodynamics of gas and of phase 
transitions. Remember that helium is monatomic, nitrogen is diatomic and a water molecule triatomic. The 
number of DOFs released during vaporisation is 3,5 and 6 respectively. Heat of vaporisation includes the effect 
of releasing the molecular bonds between molecules (the bond energy ~ 5 eV/bond), the efTect of releasing 
the DOFs (translation and rotational), and the work done by the vapour while rapidly expanding its volume 
under constant pressure. 

2.3 Start from D’Alembert’s principle. 6T + Q,6qi - d(j>i6qi)/dt. 

2.4 Convert into generalised coordinates in steps quite similar to ones you follow in order to derive Euler- 
Lagrange’s equations of motion. Instead of T , you come up with 5, both are initially defined in Cartesian 
coordinates only. The equivalence of the two will become obvious. 


Copyrighted material 



576 Classical Mechanics 


2.5 Do this problem in order to have clear idea of generalised forces. 

2.6 Define the usual generalised coordinates 0 and <f> for the double pendulum. Find the expressions for 
generalised forces from infinitesimal virtual works due to virtual displacements 60 and 6<j> 

2.7 (i) L s (mi + m 2 )r 2 /2 + mir 7 0 7 /2 — m 2 y(r - /) 

(ii) L = Mx 7 /2 + m(i 2 + / 2 0 2 )/2 - mlxOcoeO + mglcoeO 

(iii) 2 DOF, say the angle 4> made at the centre of the wire by the instantaneous position of the bead on the 
wire and the rotating line joining the centre of rotation (a point on the wire) and the centre of the wire, and 
the angle 0 made by the rotating centre at the hinge of rotation with respect to a fixed direction in space. 
Show that L — T — \mR 7 {(0 + tf) 2 + 0 7 + 2 0(0 + ^)cos*} 

(iv) A nonholonomic case with 2 DOF and 2 nonholonomic constraints in 4 generalised coordinates, say (z,y) 
the Cartesian planar coordinates of the cm, <(> the coordinate angle of rotation in the plane of the disc, and 
0 the coordinate angle that defines the orientation of the plane of the disc with respect to the space-fixed 
x-axis. The constraint relations are x 7 + y 2 — R 7 jt 7 = 0, and isinV’ — ycoe4> = R This gives the 
Lagrangian as L = ( lj> 7 + Jip 7 )/2 + m(x 2 + y s )/2, / and J being moments of inertia about the axes 
defining the angles <j> and if> respectively. The final solution corresponds to the displaced circular orbit of the 
CM. 

(v) Can be found in many text books. 

2.8 The problem is not trivial, particularly if you take the gradual drop in the height of cm of the winding roll 
from the surface of the incline. This will also contribute to both kinetic and potential energy terms. Divide 
the whole length of the tape into two parts: unwound and to be unwound. The remaining number of turns 
(n) and the length (L 2 ) to be unwound are related to the thickness of the tape ib by a relation L 2 2r xn 7 k. 
Follow the moving location of the cm both along the incline (z-axis) and perpendicular to it (y-axis). Note 
that unfolding of the tape is possible even when o is negative with cr > tan _l (3/2jrn e ). 

2.9 The inclusion of the rotational kinetic energy term in the problem leads to the expected difference. 
2 L = m(R - r) 7 0 7 + IR 7 0 7 /r 7 -2mg(R — r)(l - cos 0), 0 being the angle of the bob with the vertical. 

2.10 At s ome st age expand in terms of a small parameter, say a = k/2m 0 c 7 , and finally obtain the period 
P ~ 2*\/m 0 /k(\ + 3aa 2 /8), a being the amplitude of oscillation. 

2.11 DOF = 1, 0 — polar angle with respect to the vertical axis of rotation. V = — yr # (l — cos0), T = 
r 2 (0 2 + u» 2 sin 2 0)/2. Hence, 0(dL/d0) - L = constant # T + V = E. 

2.12 If necessary, consult Greenwood’s book. 

2.13 You must know how to differentiate a definite integral with respect to a variable that occurs as one of 
its limits. This complicated looking Lagrangian gives the same equation of motion as that for a 1-D simple 
harmonic oscillator. 

2.14 The force of friction guides the translational motion of the cm; it also provides the torque for rolling; 
and rolling without slipping obeys the standard nonholonomic constraint relations. With the help of these 
three vector equations, find the frictional force to be given by F = {mi 2 /(l + Jk 2 )}w x v, k being the 
radius of gyration of the ball. Then show that motion under this F is always circular. In the mid 1940’s, 
Einstein was asked about this problem, namely the motion of a marble on a turntable. A careful experiment 
was performed in 1979. It is now your turn to work out the theory. 

2.15 1 DOF, say the angle of swing (0) with respect to the vertical. The length of suspension l(t) is variable: 
the constraint is rheonomic. The Lagrangian L = m(l 2 + l 7 0 7 )/2 — my/cos 0. In the equation of motion / 
controls the effect of damping. When / is negative, it becomes a case of negative damping, implying that the 
oscillation should increase with time. In a child’s swing, pulling of the chord at the right time will increase its 
amplitude of oscillation. 

2.16 For any closed system in n-dimensional space, the number of energy integrals = 1, number of angular 
momentum integrals (each for two axes at a time) = n C 2 = n(n — l)/2, number of linear momentum integrals 
= n and number of centre of mass motion integrals = n 

2.17 Potentials are either given or can be formulated. You have to construct the kinetic energy terms either 
in spherical polar, or cylindrical polar coordinates, as it seems appropriate. It would be fun to derive these 
integrals of motions, not all being trivial. If you cannot derive them by the Lagrangian method, simply 
differentiate with respect to time and show that they vanish. 


Copyrighted material 



Appendix C 577 


2.18 Use the expressions for transformation of generalised momenta and Jacobi integral under generalised 
coordinate transformations, namely J' — J — pjdqj/dt, and = p } dqj/dQ,. 


Chapter 3 

5.1 The problem is similar to problem 2.14, but here it is a case of sliding with negligible friction. Start 
with the general equation of motion given by Eq. (3.6). The general solution in terms of complex numbers: 
z(t) = {z(o) + v 0 t + z(o)iut}exp(-i<jt). If the particle is thrown from the centre of the disc with initial 
speed v 0 , the track on the disc would thus appear as a spiral. 

3.2 Represent rotation by a matrix of transformation. Obviously, Oijbyt / evcn though both the 

matrices arc orthogonal. But for infinitesimal transformations, x\ = Now demonstrate the 

commutability of two infinitesimal rotations. For the rest of the problem, all necessary guidelines are given in 
the text of the problem. 

3.3 Set 0 = 0, and rename the variables t>, t and 2u as R. 6 and - » respectively. 

3.4 In order to calculate the kinetic energy of the system, one has to find out the true inertial velocities of the 
two point masses of the dumbbell. Apply the most general formula (3.33) to the case, and deal the problem 
in spherical polar coordinates about both cm of the dumbbell and the centre of the circular track. 

3.5 The conservation of angular momentum gives mu 0 {R + h) 2 - mu(R +h - gt 2 /2), which on integration 

yields 9 = jjudta u a T -f /3JL Hence the result. 

3.8 Use the formulae (3.33) and (3.34). For example, for the tip of instantaneously located horizontal blade 
(r = r„j, R — Rot, i pointing towards the outward axis of blades and k. vertically upward), the inertial 
velocity and accelerations arc 

t» 0 = u 0 Ro cos, Sit] - u 0 r 0 cos Sit] + u/\ r„t, and 

a,, = (u> 0 ilr 0 sin {It - w^Ro cos 2 ftt)2 - (u 2 r 0 coa 2 ilt + wjr 0 )J 

For the tip of an instantaneously located vertical blade (r = r a k and R = R 0 i) 

»o = (wofioCosQt - u>\r 0 )j. and o„ = (2uo»ir 0 co &fit - ui 2 # 0 co8ftt)» - w\r a k 

3.9 These are Eulerian rotations. Try without looking at section 12.21 

3.10 The KE docs not change in this example, whereas in Foucault’s pendulum, the average KE is one half 
of the maximum speed at the centre. However, you should solve the equation of motion, which just contains 
the Coriolis term. 


Chapter 4 

4.1 This problem is often referred to ‘Jack and the sky hook’, because of its similarity with ‘Jack and the 
bean stalk’, the bean stalk appearing to be self-supported. So Jack must be located on the equator, the total 
centrifugal force applied over the entire bean stalk has to balance its total weight. Since gravity changes with 
distance as r~ 2 and the centrifugal force as u> 2 r, the length of the bean stalk is obtained by solving for L in 
the equation GM{R~ l - (R + £) -1 } = u 2 {L 2 + 2RL)/2, R and M being the radius and mass of the 
earth. The equation for tension T would satisfy the differential equation dT/dr = p x the net downward 
force per unit mass, \i being the mass per unit length of the rope. For free support, the tension should vanish 
at both the ends of the rope. Similarly, calculate the total energy E = KE + PE. You would get E > 0, 
which means that given a slight perturbation, the rope will finally escape! 

4.2 Take the origin at the centre of mass and proceed. Express the equations of motion of each body in terms 
of the position vector of that body. Ej = mjJPi/m 2 , L2 = miLi/mi 

4.3 (a) Use Eqs (4.15) and (4.16). In the second case, you have to derive the equations of motion, satisfying 

the given constraint and then an expression for Y e ir(r), in plane polar coordinates. Ans. r < the 


Copyrighted material 



578 Classical Mechanics 


famous golden ratio. For further discussion read Hcstcncs’s book. 

(b) The equation for a vertically placed cone (apex pointing downward) with semivertex angle a is given by 
r = z tana, kinetic energy T = \m[r •* + r 2 6 2 + z 2 ), potential energy V = mgz, angular momentum 
h = mr 2 9. Find K«(r) and analyse the stability of circular orbits. 

4.4 For the origin at a point on the circumference of a circle, the equation of the circle in plane polar coordinates 
becomes r = 2a cos 9. Find the angular momentum integral H = r 2 9, and show that the radial component 
of acceleration from /(r)f — r9 2 = - 8a 2 H 2 r~ 5 , an attractive force varying as the inverse of the fifth power 
of the radial separation. Find that the total - energy = KE + PE = 0! What docs it mean? 

4.5 Some more examples similar to the above one. Exactly similar procedure, (i) inverse cubic, (ii) inverse 
quintic power law of radial distance. 

4.8 (i) j = 2KHr~ i (dr/d$)r - KHr~*9 

(ii) co8 0 m = (— 1 ± VI + 48e 2 )/8e 

(iii) The sidereal period of revolution P = 5/(S - 1), for superior planets (a > lAU),andP = 5/(5 + 1), 
for inferior planets, 5 being the synodic period (= the time interval between successive return of any planet 
with respect to the moving sun-earth line). Use Kepler’s third law. 

4.7 Use the standard conic equation for parabola, and show 9 = Hr~ 3 = 4 >JK/p? con*(9/2), p being the 
semilatus rectum of th e parabolic or bit. Integrate this equation to obtain the time spent by the comet inside 
earth's orbit, t a = 2y/(2r - p)/K(p + r) ~ 76.6 days for the given problem. 

4.8 (a) Differentiate Eq. (4.43), substitute in Eq. (4.24), use Eq. (4.40), eliminate v and r by means of Eq. 
(4.42), in order to obtain 2 rdt/P = (1 - ecos E)dE, which on integration gives Kepler’s equation. 

(b) v - g + 2csin g + |e 2 sin2 g + ^c 3 (13sin3fl - 3sing) + gge 4 (103sin4ff - 44sin2j>) + •••, 

E = g + esin g + £c 2 siu2g + £e 3 (3sin3p - 3sin g) + |e 4 (2sin4 g - sin2p) + ••• 

4.9 Grav. force F g = GAf s m ( /r 2 , Force due to radiation pressure F p = a,h < * > /c, <T f h = Thompson 

cross-section for electron = 8x(e 2 /m t e 2 ) 2 /3, < a > = average energy flux from the sun = Irg/4xr 2 , X® = 
Luminosity of sun = 3.86 x 10 26 W. Sec that F g /F p ~ 40, the electron cannot escape from the solar system. 
Setting the radiation pressure on an electron equal to the weight of a proton, one obtains the Eddington limit 
of maximum possible equilibrium stellar luminosity, corresponding to L./M. = 1.5 / e*. 

4.10 Assuming a circular orbit of earth, v c4c from the orbit = v/2u ()tb . The excess speed required in the 
direction of earth’s orbital motioi. (y/2 - l)v or b = o exccM . In order to achieve this, the speed at which the 
object has to be thrown from the surface of the earth v = yjv 2 xcnt + «^ cirlh = y^(12.1) 2 + (11.2) 2 = 
16.4 km/s, since v or b = 29.8 km/s. 

4.11 Because \J Af®/Af® = 574 > (dist of sun/dist of moon) = 390, the force on the moon due to earth 
< force on the moon due to the sun. Hence, the net centripetal force on moon is always acting towards the 
sun (or its vicinity) irrespective of the location of the earth. The orbit of moon is therefore always concave 
towards the sun. 

4.12 < r >, = f* rdt/P = a(l + e 2 /2), < r ># = aV 1 - e 2 , < r >. = rds/tf ds = a. For the 
last one you should use Eq. (4.42) and change the variable of integration to the eccentric anomaly E. 

4.13 Transfer orbit is assumed to be Kcplerian. In order to meet the counter-earth, its period has to be 
T = T@(2n + l)/2n, n being any positive integer. Most satisfactory choice is n = 1. Answer to the last 
part is approximately 4e radian or 3.3 degrees of arc. 

4.14 ri = 1.0 AU, rj = 1.51 AU. The required speed for sending from the surface of the earth = 11.56 km/s. 
In-flight time = 256.8 days. 

4.15 Since the intended orbit was a bound one and the launching speed (hence KE, and also total energy) did 
not change, it will still be an elliptical one. Calculate the total energy for the intended orbit, which gives using 
Eq. (4.38) the length of the semi-major axis a = 1?® + h. Since the distance between two foci via any point 
on the orbit is 2a, and given that the distance from one focus = a, the satellite must be on the semi-minor 
axis. Then from the property of reflection angle (see section 4.10) tan<£ = b/ae, giving e = cos <f>. 

4.16 Distance to nearest approach from prime focus = radius of the star, and the initial speed of the particle 
= the speed of light = c. So, = a(e -.1) = \(GM m /2E')\ (esc y - 1). Taking the given values of 
and M m , the classical deflection of lights = 2sin- 1 (GM 0 ./(GAf n , + Ru.c 2 )) = 20°.96. 


Copyrighted material 



Appendix C 579- 


4.17 For hard sphere of radius r, the scattering cross-section er(^) = r 3 /4. Effective focal length of Rutherford 
scattering is / = | K/4E'\. 

4.18 First calculate how many neutrinos (actually t> R ) are emitted by the supernova = total energy avail¬ 
able/energy per neutrino. If they are emitted equally in all directions, then at such a far off distance, how 
many of them passed through per square metre of normal area (that is, integrated flux at earth)? say N 
particles/m 3 . Now if there are N' protons in the Cerenkov detector and <r v is the reaction cross-section 
between protons and i> e » then the number of antineutrinos caught in the detector N c = NN'<r v . Knowing 
N, N' and N e , show that <r„ ~ 10" 47 m*. 

4.20 (i) fl, = o a (l - e 3 )u>p/2dP , (ii) 0, =■*,5(1 - vT^?)/ e », (iil) fl, = 6(Jf7*<» 3 )(l + e*/4)/(l - 

e 2 ) 3 

4.21 See Am. J. Phgi. 44, P 687, (1976). 

4.22 Ans. A sphere of radius 15 frn. 

4.23 Use the formula (4.79). Answer is 6.7 x 10 13 kg/year. 

4.24 This is basically due to a simple geometrical property of prolate spheroids. 

4.25 The Roche lobe radius JZh = (16p p /p»)^ 3 iip 2.51 Rp. A satellite is broken into pieces to form rings 

if it comes closer than Rr\. 

4.26 Add the rotational part, due to centrifugal potential energy. 


Chapter 5 

5.1 If the Lngrangian is linearly dependent on velocity, the Hamiltonian vanishes, and vice versa! The latter 
is true for light rays and ultrarelativistic particles. 

5.2 All 9 i’s have to be expressed as explicit functions p/s. For this invert the equations obtained from 
Pi = dL/dqi 

5.3 These strongly resemble the creation and annihilation operators defined in the context of quantised har¬ 
monic oscillators. Then, how about the Hamiltonian equations of motion expressed in terms of creation and 
annihilation operators? 

5.4 These lopk very similar to Dirac’s formulation of constraint dynamics. We shall talk more about them in 
chapter 9. 

5.6 These are six independent constants of motions. The first three correspond to the conservation of angular 
momentum, the case being one of the central forces. 

5.6 (i) H(x,p) = p 3 /2(1 + 2/?x ) 3 -I- w 3 x 3 /2 + ox 3 

(ii) If you treat z(t) as rheonomic variable, H(9,pt) = (p + ml z Bin 9) 3 /2ml 3 — mgl cos 9 — mi 3 /2 — mgz, 
otherwise it would be a very complicated expression for H = H(9,z,pt,p,) 

(iii) B(q,p,t) = [p - F(q,t)) 3 /2G(q,t) + V(q,t) 

(iv) £T(x,p) = p 2 /[ 2 m{ 1 + (df/dx) 3 )) + mgf(x) 

5.7 (i) il(r,r,p*) = - pr 3 /2 + pj^pr 3 - GMm/r, the last two terms being the effective potential for 
r-motion. 

(ii) R(9, fl,p«,Pv) = pJ/ 2/3 + (p* - p*cos0) 3 /2/tsin 3 0 — Ii 9 3 /2 + mgl cos 9, the second and fourth 
terms constitute the effective potential for 6 -motion. 

5.8 r| = r,- + c, and p- = p, for the i th particle, e being a small constant infinitesimal translation. 

The condition for invariance: H(ri,Pi) = fT(r{,p(). On substitution for p( and r{, the rhs becomes 

H (r,-, Pi) — < • Ep,’, which means Eft = constant, as c is arbitrary. In a similar way prove the second part. 

5.9 (i) * = f = dH/dp = c*(p - cA)/D, where D ss y/c 2 ^ - eA) 2 + m^c 4 , 

p = {cMfp - eA) ■ V)A + c?e(p - eA) x (V x A)}/D - eV* = e(E + • x B) 

(ii) q = dH/dp = p/(2mw sin 3 ut) and p = - (dH/dq) — pucolwt + mu 3 q 

6.10 From Eq. (5.26), it is easy to see that p changes only in the direction of Vp, so that only p, will be 
affected due to the vertical gradient of p. For horizontally grazing incidence of light, p, C p, and one can 


Copyrighted material 




580 Classical Mechanics 


take p to be practically constant. Differentiating Eq. (5.25) with respect to time and eliminating p, using 
Eq. (5.26), one can easily establish i ~ ( c/p a )(dp/dz ). Thus, for p decreasing upwards, the curvature of the 
trajectory will be convex upward (inverted mirage) and vice versa. 

5.11 Just algebra. 


Chapter 6 

6.1 You have to minimise potential energy, which can be formulated to appear in the form 6 J* y ds = 0. 
The solution corresponds to catenary: y = a cosh (x/a) 

6.2 This is a problem of shortest route on the surface of a sphere: 6 f* da = 0, da 2 = R^(dX 2 + 
cos 2 A d<j> 2 , A = geographical latitude, and d> = geographic al longitude. D efine A' = dX/d<t>, and arrive at the 
equation of geodesic in the form / d d<t> = f C 0 dX/ cos Xy/cos 2 A — C 2 , C c being a constant of integration. 
Acceptable solution exists for cos 2 A > C 2 = cos 2 X e , say. Thus at A = A<,, the path will always be convex 
towards the nearer pole. One can also find A", and show that X" < 0, at A' ss 0, and A > 0. 

6.3 Express L = mz 2 — V, and minimise J Q ’* Ldt with respect to variations of A and C. 

6.4 Here both x and t are transformed, with the result that the Lagrangian would transform according to Eq. 
(6.29). It is not so apparent that the form of the Lagrangian would remain unchanged, but not surprisingly, 
covariance would demand it. 

6.5 This is also an interesting property of the given Lagrangian. Most useful in the field theories. Straight¬ 
forward algebra, dt = XdT, q = X-'dq/dT, and set 6W = ( dW/dX)6X = 0 

6.6 Since at the surface of discontinuity, the potential is changed abruptly, the normal component of momentum 
would change abruptly and the tangential component remains continuous, that is, v t sin i = t/jsinr. For 
electromagnetic waves, the boundary conditions for electric and magnetic fields are such that the speed relation 
is just the opposite. 

6.7 Use r* = r 4* eu, <' = t + e, t being a small quantity. This leaves the argument of V invariant. Then 
apply Noether’s theorem (Eq. (6.40)) 

6.8 For infinitesimal transformations, choose ft = 1 + f, and ft = 1 + yt. Now, 6W = 0 implies y — 2 
and n = — 2. The final result follows from the application of Noether’s theorem. 

6.9 We know that for free particles, Ldt = - m 0 c da. In the general theory of relativity, all motions are free 
and follow geodesics, even though gravity is present (gravity is considered to be as fictitious as centrifugal or 
Coriolis forces are). So the form of the Lagrangian from the above relation is something that varies as the 
ratio ds over dt. From the given metric, find ds/dt, and hence the form of the Lagrangian L (per unit mass) 
for motion of test particles in the gravitational field of the heavy object, given by 


Once L is known, you can derive the equation of motion. You should, however, remember that dr/dt = v r , 
not u, and that v 2 = r 2 + r 2 0 2 + r 2 sin 2 0d> 2 

6.10 Assume that go, are functions of coordinates. Write down Lagrange’s equations of motion. For each 
term do the necessary differentiations and finally arrange the equations of motion in the form prescribed in 
the problem. The relevance of the last two exercises will be clear only when you would study tensor calculus 
and general theory of relativity. However, remember that L is defined in a configuration space, but ds and 
for metric spaces only. So the analogy that this problem tries to impress upon you is valid only for free 
particle dynamics, for which the configuration space is indeed a metric space. 


Chapter 7 
7.1 Ans. I = 4 R. 


Copyrighted material 



Appendix C 581 


7.2 Use da 2 = dx 2 + dy 2 , r 2 = x 2 + y 2 and a = 0 at r = ro- 

7.3 Show that the relation (7.10) satisfies the differential equation (7.8) 

7.4 Ans. L = 20(2x - 0)R/x 2 , H = RB/x, i\ yp0 cycioid : P.«r»igh« = y/ 9 i 2 * ~ 9 ) : T 

7.5 Derive the Lagrangian for the motion of the disc: L = 3mu*/2 - mgy, u being the measure of distance 
along the curve, y the height of the cm. For tautochronous motion, we must have the potential energy term 
oc u 2 , suggesting y to have a form y = a + fcu 2 . Since dx 2 = du 2 - dy 2 , this gives an idea of the nature 
of dx 2 . Show that they satisfy the equation of a cycloid. 

7.6 The effective radius increases, which becomes infinite when the effective gravity vanishes on the surface 

of the earth for w ~ 17w„ _ 

7.7 This is a case for which brachistochrone is a circular arc! Travel time = 2 y/m/c ln(* 0 + V x o + Vo/Vo] 

7.8 da 2 = p 2 0 d<f> 2 + dz 2 , v 2 = 2 g{z 0 - z). Defi ne a p arameter < = p 0 4>, and proceed. The 

travel time along the brachistochrone i* = 9 0 csc(0 o /2) z 0 /2g, where 8 0 is the solution of the equation 

(1 - cos 8 0 )/(8 0 - sin0 o ) = z 0 /Co, Co = Po<f>o 


Chapter 8 


8.1 Use Eqs (8.10) and (8.12 14) 

8.2 Follow a method similar to ones used in section 8.3 

8.3 P = (-p 1 .p 2 .p 3 ) 

8.4 Fj = RijXjj/j and p, = fi.jp' 

8.5 The Jacobian for the first transformation is not unity, p 2 = p 2 + Po/r 2 + p|/( r2s *h*0) 

8.6 (i) Make use of the relations (8.15). (ii) See Whittaker’s book. 

8.7 (i) Use the condition (8.17). 

(ii) Use the first of the conditions (8.15) and finally obtain aftq 2 ° * 1 = 1. Since q is arbitrary, a = 1/2 
and ft — 2 

(iii) Use the elementary PB relations as given in problem 8.6 (i). 

(iv) The same conditions as above. 

(v) Keeping t — constant, use the conditions (8.17). 

(vi) (x) Use the elementary PB relation. 

8.8 (i)Q = tan -1 (mwy/p), P = (p 2 + m 2 u 2 q 2 /2mu, K = uP{ 1 + w sin Q cos Q/u> 2 ) 

(ii) Q = tan ~ l (mwq/p - F(t)/up), P = p 1 /2nw + mw(q - F(t)/nuj 3 ) 2 /2 
K = uP + y/2P/mu 3 [uF{t)a\nQ - F(t)cosQ] + F 2 {t)/2mw 2 

(iii) Q = p + muiq, P = (p — muq)/2mu, K = ( Q 2 + 4m 2 w 2 P 2 )/4m 

(iv) Q = ln(l + Jq cosp), P = 2(1 + Jq cos p)>Jq sinp 

K = ^{tan-MF^feO-l)]} 2 + §m^((eO - l) 2 + P 2 / 4e 2 «] 2 

8.10 Since the particle is moving under V = - rngz, Q(t) = q(t + t) = g(l) + p(i)T/m + gr 2 /2 and 

P(t) = p[t + r) = p(t) + mgr, satisfying the relations p = dF\/dq and P = - dFi/dQ 

8.11 The transformed Hamiltonian is desired to have the form K(Q, P) = P 2 /2 + u> 2 Q 2 /2. From the given 

F 2 , one can obtain using (8.12) and rearranging the terms with Taylor expansions, p ~ P -f 2aQP and 

q ~ Q - aQ 2 - 36P 2 . Substituting in H and equating K with H , obtain a = a and ft = 36 - 2a. Now 

the solutions in terms of Q and P are perfectly harmonic, and the anharmonic solutions for q and p can be 
obtained from those for Q and P by the transformation q — Acoswt - a A 2 cos 2 ut — {ft + 2a)u A sin wt, 
and so on. 

8.12 dy - pdx = d(y - px) + xdp = X{dY - PdX) say, where (x,y,p) -» (X,Y,P) so 
that the contact nature is preserved. Taking A = 1, one can have X = p, Y = y — P®t 

x = - P, y = - PX + Y, which make a contact transformation between (®,y) and {X,Y) 


Chapter 9 

9.1 All the rules given in Eqs (9.1) (9.7) are in fact common. Ans. 0 and ([A, C], [B, Dj] 


Copyrighted material 



582 Classical Mechanics 


9.2 (i) a ■ b, (ii) 2«(a ■ r), (iii) - r x VV = T = torque, (iv) (g x f)L + [ fi, 9 j]LiLj. 

9.3 [Li,Ajk] = UjlAik + UklAij, [Ajk,Au] = (f*/m 6*j + (jlmbik + Ckimfy + (jimfkl)Lm 

9.4 The starting point is the relation mv = p - cA(r,t). 

9.5 To show A as a constant of motion, prove [A, H] = 0 

[Ai,Aj] = (ijkLk, \Li,Aj\ = djkAk and [Ai,H] = 0 = [Li,H). These are in fact generators of Lie algebra 
0(4) = 0(3) x 0(3) for E < 0 and L(3,1) for E > 0. 

9.6 Write the usual Taylor series expansion and use Poisson’s theorem (Eq. (9.14)). 

9.7 Show that Ap, = - (du/dqi) = (dpi/dO) A0 = - (dw/dqi)A0 = [p,, u>]A0. Similarly for Aft. Then 
use the relation (9.7) for any / = f(q,p): 

l- + = vm 


9.8 Starting relations are Poisson’s theorem (Eq. (9.14)) for Q, and Pi, and the definitions of Q, and p, 
from Fj. Then evaluate using the basic definition of PB relation (Eq. (9.1)) the PB [Qi,dF?/di] and at 
an intermediate stage you should obtain [Oi,dF 2 /dt] = d 7 Fj/dPidt + (d 7 F?/dqjdt)(dqi/dP,), which will 
finally reduce to dQi/dt 

9.9 dQ/dT = - dq/dt = - dH(q,p)/dp = - dH(Q, -P)/6(-P) = OK/dP. Similarly, dP/dT = 
dp/dt = - 0K/8Q 

So Hamilton’s equations of motion are preserved. But the PB relation [Q, P] = — 1 [fl.p] 


Chapter 10 

10.1 Show that (d/dt){dS/dai) — ( dll/da ,). Since// = E = a„, dS/da, = 0 fori = l,...,n-l 
and dS/da n = t + const. 

10.2 W(x,y,z,t,pi,p 3 ,E) = — Et + piz + p 2 y — ^y/E — p\/2m — p\/2m — mgz and = dW/dE, 
02 = dW/dpi, 03 = dW/dp 2 are also constants by Jacobi’s theorem. 

10.3 The Hamiltonian is /f(p,x) ,= (p 2 sech 2 z + x 2 )/2 and the complete integral for W is 

W(z,E,t) = — Et + / y/2E - x 2 coshx dx. Now find and solve the equations of motion by both the 
methods. 

10.4 The HJ equation in F 2 = S(q,P) would be (1/2 m)(dS/dq) 3 + mu 3 q 3 /2 = f(P) with p = 
dS/dq and Q = dS/dP. Solution to HJ equation gives S($, P) = {/(P)/wj sin -1 y/mu 3 q 3 /2/(P) + 
\qs/2mf(P) - m 3 u 3 q 3 . Now Q = dS/dP = {/'(P)/w) sin"' y/mu 3 q 3 /2f{P), thus giving the CT 

, = x/2/(P)/mu; 2 sin(wtj//'), p = cos(uQ / f) 

10.5 Write the Hamiltonian. On comparison with Eq. (10.17) a(r) = 0, 6(0) = a cos 0, and therefore the 
complete integral for HJ equation takes the form (see Eq. (10.18)) 

W(r,$,4,,E,0,Pi,t) = - Et + pt<t> ± J ^0 - 2pa cos 0 - -dO ± J yJipE - ^dr 

where the first integrals of motion are E,p * and 0. Since the angular momentum L = yjp\ + (pj/sin 2 0), 
0 = L 3 + 2pacos0, and therefore L is not conserved but its ^-component p^ is. A particle can fall 
into the centre if p r = mf = -y/2mE - ( 0/r 2 ) —► -oo as r —» 0, that is, if 0 < 0. Hence the impact 
parameter satisfies s 2 < aco&O/E leading to the cro6s-section a = (xa/E) cos a, a = Z(foo.a)- Averaging 
over all possible orientations of a, < <r >= xa/AE 

L 3 > 0, but for a given 0 and E, L 3 becomes negative for cos0 > 0/2pa, and all such particles are absorbed. 


Copyrighted 



Appendix C 583 


10.6 Derive the Hamiltonian and show that the HJ equation is 


dW 2 

dt m(u + v) 


f dw Y ± f dw \ 3 . 1 

+v \to) u« + v)[w) 


1 + f («’ ■ 


) =0 


and proceed for the separation of variables as demonstrated in section 10.4. Note that E is not energy here, 
but the magnitude of the electric field. _ 

10.7 The action variable J = (1/x) /*’ y/2m(E - V 0 tan J (aqj dq , where the limits q x and q 2 are given by 
qi = — q 2 , and tan 2 (a^2) = E/V 0 . On integration, this results in J = ( %/2 m(E 4- Vo) - y/2mV 0 )/a, 
and the angle variables = vi + /3, where the frequency v = dE/dJ = a^/2(E + V 0 )/m and the angular 
frequency = 2*i/ 

10.8 J* = 2and J r = § J2mE 4- 2mJfc/r - ai/r* dr. These are the s ame as those for the 

Kepler problem (sec Eqs (10.41) - (10.43). But J 9 = § yja\ - o$csc 2 0 - /3 2 sec 2 0dB which is different. 
Substitute u = tan 2 0 and do a contour integral (branch cut from ui to uj) to evaluate J 9 = *(cx 9 — a 6 ~ P) 
and finally J r = - 2*aa + s/2* 2 mk 2 /(-E). Show that E = - 2 * 2 mk 2 /(J r + 2 J, + J* + 2*0) 2 , 
giving iv = i/* = A* 2 mk 2 /{J r + 2J, + J* + 2x/9) 3 = v e /2, implying that the particle completes two 
oscillations in 8 during each orbit. 

10.9 The form of the Hamiltonian would be like the one shown in Eq. (10.16) with the condition that 
V(r) * 0 only at r = a, the radius of the sphere. It is the last integral in Eq. (10.18) that would appear 
in the integral for J r . J T = (\/2*)fi m s/2mE^JlJ/P)dr, L= angular momentum, and E = energy. Thus 
J T = J r (Ea 2 t L). Thus for adiabatic changes in the radius a, we must have Ea 2 = const., and for the angle 
of incidence a, sin a = r m /a = L/a/2mE= const. 

10.10 Set up the HJ equation. The constant of separation for the ^-variable will be a+ = p+/2mr sin 0 - 
eBp$/2m and its corresponding action variable — 2xa*. Other action variables are J$ = J 9 (p+,f)), Jr — 
J r (E + eBp*/2m,/3). When B is slowly changed, p*,/3 and E + eBpJ2tn should remain constant. 


Chapter 11 

11.1 Ans. u> 0 = \/2pV/mtP for isothermal oscillations, and = y/2-ypV/md 2 for adiabatic oscillations, V = 
volume of the enclosed gas, 2d = length of the gas column, p = gas pressure at equilibrium, and 7 = c p/cx 

11.2 The equation of motion of the particle along the curve is given by m3 = — mg sin0 = — mg dy/da = 
— u 2 a, hence the required equation of the curve becomes y = u 2 /2g)a 2 . 

11.3 The flywheel of the watch drags some amount of air with it due to the viscosity of air, and hence the 
effective moment of inertia of the flywheel is slightly higher on the ground that at a high altitude. 

11.4 Let r„ = radius of the ball, R c = radius of the circular track for the ball-track contact, r e = contact 
radius of the ball on the track = r 0 /y/2 for say 90° trough, m = mass of the ball, R, = radius of gyration 
of the track, * = angular displacement of the CM of the track from the vertical, * = instantaneous angular 
displacement of the centre of the ball from the vertical. There are 2 DOF (no nonholonomic constraints). 

KE = T = \MR]d> 3 + \mr 2 i> 2 + $(f./r e ) 2 (Jle* - r*) 2 , and PE = V = MgR(\ - cos*) + mgr( 1 - 
cos*) 

Set up Euler-Lagrange’s equations of motion and seek for the normal mode solutions * = A x cosud, * = 
A 2 cos Lit. Because of the coupled nature of the equations of motion, there would be dramatic starts and stops. 

11.5 Set up the Lagra ngian L = I0 2 /2 + mgrcoa(6 + *) - m'grB + constant, giving the period 

P = 2x N ///mgrcos* 

11.6 Ans. P ~ 2x v /m 0 /Jk[l + 3ibo 2 /16m 0 c 2 ) for the first case. 

P = 2* s /^fk\\ + 0 2 J 16 + 110;J/3O72 + 1730®/45 x 2 14 + •••] 

11.7 Set up the Lagrangian and the Routhian in order to find the effective potential for the 0-motion given 
by Vrfr(0) = p^m 2 /* sin 2 0 - (g/l) cos 0. For steady motion, V^(0 O ) = 0 giving sec 0 O sin 0 O = 


Copyrighted material 



584 Classical Mechanics 


(pJ/Tn 2 !*®) and the angular frequency of oscillation in 6 about 6 a is given by u = \Z(d 2 V e g(8 o )/d0 2 ) = 
y/g(l + 3co8 2 0 o )//co8 0 o —»= yJTgJ\ for B 0 —* 0. 

11.8 Ans. wi,j,3 = w 0 \/3, — 3, 0 where w 0 = y/k/m. 

11.9 The Lagrangian for the system is L = jm(u 2 + u|) - + («2 - m) 2 ), so that the 5-matrix is 

not diagonal. This problem is quite similar to the one worked out in section 11.2.5. 

11.10 In the first case, b -matrix is not diagonal and in the second case o-matrix is not diagonal. Procedures 
arc once again straightforward. 

11.11 Make the bob out of any fcrro-magnetic substance and apply a strong magnetic field in order to change 
the spring constant. You would need about 0.03 Tesla of magnetic field which can be created inside a solenoid 
of diameter about 0.1 m carrying a dc electric current of about 5 A, to be applied on a bob made up of soft 
iron of mass about 8 g. 

11.12 The pyramid can be suspended from a spring of known spring constant, say *j, and the system is made 
to settle in a normal mode oscillation given by w 4 - w 2 {(ti + fc 0 )/mj + 1b 0 /™o} + JbiJk 0 /mimo = 0, where 

= mass of the bob, mj = mass of the pyramid, k 0 = spring constant of the support spring, u> 0 = y/k/m 0 . 
Measurement of u ot u, and the total mass M = my -f m„ gives m 0 . 

11.13 You have to rock it vertically with a frequency twice the natural frequency of the nonin verted pendulum. 


Chapter 12 

12.1 The inertial velocity of the point of contact is given by f = f cm + u x (r - r cm ). The equation for 
rolling constraint (without slipping) is (i) f = 0 and (ii) f = ft x r, ft = angular velocity of the rotating 
platform. 

12.2 If G be the centre of the sphere, its velocity v = x (ftirj + ftjrj) = |(fti + ft2)4[(ftirj + 

+ ft2)] = 0 x R, which corresponds to circular motion with angular velocity j(ftj + ih)k 
and a radius R. 

12.3 Ans. ut = - (2x/T)cotatan2a. 

12.4 Use the results from the problems 3.2 and 3.3, and/or see Konopinski’s book. 

12.5 (iii) D h = IuSij - 31,j 

12.8 Euler’s equation of motion gives IB = Fd, giving B(i) = (Fd/2I)t 2 . The equation of motion of the 
cm: x = (F/m) cosfl(t) and y = (F/m) sinfl(t). These are the parametric equations for Cornu’s spiral. 
12.7 For cylinder h = y/3a and for the cone h = a/2. 

12.0 Poynting vector N = B x H = energy density x c, where the electric field E = - er/(4xe 0 r 3 ), 
the magnetic induction due to a magnetic dipole B = (l/4« 0 c J )[(#»- r)/r 5 - p/r 3 ]. The volume density of 
linear momentum p = energy density fc = N/c 2 , the volume density of angular momentum about the centre 
of the electron is I = r x p, and hence, the total angular momentum along the z-axis L t = fl-kdV = S, 

12.10 At saturation of magnetisation, each atom of iron has roughly two electrons with line up spins. The 
sum total of all the spin angular momentum of the lined up electrons results in the increase in the total angular 
velocity of the body. Ans. 0.008 rad/s. 

12.11 Let B be the point fixed on the rim of the disc about which the disc can pivot, a is the angle through 
which the disc travels, and 0 = 28 = the angle through which the cat travels on the disc. So the angular 
momentum of the disc about B at any instant is given by Lj = | MR 2 a, and the angular momentum of the 
cat about B = L c = (dr - 0/2)4R 2 msm 2 (0/2). Since the total angular momentum before the cat began 
walking was aero, the sum Li + L e will always remain zero. 

12.12 See Routh’s book. 

12.13 (i) Polhodes are intersections two ellipsoids in w-space, one for the constant kinetic energy surface and 
the other for constant angular momentum L 2 


Copyrighted material 



Appendix C 585 


12.14 Can be found in the book. 

12.15 /n = = 2/33 = Mr 2 , the initial angular velocity u = (2H/y/T5, 0,30/y/T3) giving 

L = 5Afr 2 n/(2>/l3) and 2T = 17Afr 2 fl 2 /26. Therefore, a 6 = 8 in- 1 (2/vT3), a. = cos^n/Sv^), 
and sin a, : sin = 3:5 

12.18 Show first that the general solutions for Euler’s equations of motion gives 0/3 = Nt/C, u\ + iu>2 = 
K exp(*At 2 /2), A = N(A - C)/AC. The initial conditions are, say, W3 = 0, u\ = ft, u >2 = 0. For the 
equation for u, assume u parallel to r, so that * = aftcos(At 2 /2), p = aftsin(At 2 /2) and z = aNt/C , 
a being the constant of proportionality between u and r. Eliminate i and a in order to derive the desired 
relation. 

12.17 Sec section 12.18. 

12.19 Start with Euler’s equation. 

12.20 A variation of the symmetric top problem. 

12.21 The effective potential due to a rotating spheroid is given by Veir(r) = V(r) - j|ft x r| 2 , where 
V(r) is given by Eq. (12.129), that is, V(r) = - GM/r + ( GM/2r*)(C - A){3(f t) - 1}. The surface 
of the earth being a gcoid is defined by Vrfr(r) = const. = K, say. The const. K can be evaluated by setting 
r = ci or f i = l. Finally derive g p = ~ VV^r evaluated at r = ck, and similarly g c at r = o». 

12.23 Ans. T = 2w N /{28/Z 5 /15 - 4 R 3 h 2 + UR 2 h 3 /3 - 5Rh</4 + 3h 5 /20)/gfi 2 (iZ - h /2) 2 , wher e h 
is the central thickness of the missing flat. For h = JI, that is, a hemispherical cut, T = 2x^/2672/15g 

12.24 P = Pi + P 2 , Pi oc Mgv(a - o/) 2 /io, Pj « Mv 3 /a. The constants of proportionalities ari and 
ai for Pi and Pj can be taken as 0.05 and 0.1 respectively. So P becomes minimum for a ~ 0.7 m. 

12.25 For a spinning ball (spin axis transverse to the motion of the ball, forward spinning = spin axis going 
from right to left), the change in the translational kinetic energy A T t = - 2m(6ti 2 - 5 Ru) 0 v 0 - R 2 w %)/49 
and the change in the rotational kinetic energy AT r = m(5v 2 + 4 Ru 0 v 0 - 9P 2 w 2 )/49. Obviously, for the 
forward spinning case, u > 0 > 0 and for the backward spinning case, < 0. The changes are bigger for the 
bounce of backspinning balls off the level ground. 

12.28 (i) Ans. (fi + 2 y/2)L ~ 2 fiL, as /? ~ 3 

(ii) Maximum height for pole vault H py = v 2 0 /2g + h 0 ~ 6 m 

12.27 (i) Total force of reaction (= body weight) will produce more pressure if it has to act over a smaller 
surface area. During diving, the force of reaction is due to the drag force which depends on area of contact 
and velocity. For a given speed, the drag force is minimum if the area of cross-section with water is minimum. 

(ii) Internal forces can produce a net torque as well as net angular momentum, without violating the torque- 
angular momentum relation. 

(iii) Two identical pcndula coupled by the shoulder link oscillate in exactly opposite phase in their normal 
mode of oscillation. 

(iv) Given a volume, the surface area is minimum for a sphere. More round the configuration, less must be 
the enclosing surface area. 

12.28 PE = V = — ^/ig(L 2 + 2 Lx - z 2 ), x being the vertical drop of the free end of the chain from its 
original level, p. the mass per unit length, L the total length of the chain. KE = \p(.L - z) 2 x 2 . Now show 
that x — g + x 2 /2(L - z) > g. The physical reason is that therp is some amount of tension in the chain 
that pulls the free end down in addition to gravity. 

12.29 Sec Am. J. Phya. 57, p40 (1989). 

12.30 See Proc. Royal Society of London A405, p265 (1986). 

Chapter 13 

13.1 dli = 6 x 4 + (flu, /dij) 6 xj = (5y + dtii/dxj) 6 x } and hence from the Lagrangian definition of e i} , we 
have e,-j = ( duj/dxj + duj/dxi )/2 + (3u*/3z<)(9u* /dxj )/2. Similarly, from the Eulerian definition of *\ 
dli = Sxi = (Stj - du'Jdx^bx), and hence, ^ = {du'Jdx^ + 3u'/dx')/2 - {du'Jdif^du'J 6 x ^12 


Copyrighted material 



586 Classical Mechanics 


1S.2 Take the definition of Oj 7 from Eq. (13.6), differentiate twice and prove the relations. The given strains 
are not compatible. 


13.3 Use u x = Uycos^ - ti^sin^, u, = u, sin 4> + u*cos <t>, u x = u x . e rf = dur/dr, = tx,/r+ 
r- OTiJd<t> y e xx = du,/dz, e r + = \{r~ l dx*/d<t> + du^/dr - ujr), e xr = A(dur/dz + du,/dr), e l6 = 
j( r dui/d<t> + du 4 /dz). For the given problem, e rr = a - b/r 2 , e** = o + b/\ 9 
e *r — 0, e x * = cr/2; the dilation A = e rr + + e 

Ur — —cr/2, u+ = 0 , u t — cz 


e r + — 

= 2a + e; the components of the rotation vector 


13.4 e,ji = (en,ej2,e33,ex2,ei3, e 2 3) = 4(1,1,2,0,2,2), dilation A = 4e, rotation u = e(2* + ) - 24), and 
shear: about z-axis by an angle At, and about y-axis by an angle At. The principal strains Aj = t along 
= A 2 ,3 = (3 ± \/33)e/2 along r 2<3 = • + j ± 8*/(>/33 T 1). 

^ c P r * nc *P a ^ strains are 0 , ±y/o* + b 2 and the corresponding principal axes are defined by k, (a ± 
y/a 2 + b 2 )t/b + j. If we rotate by an angle 9 about the z-axis, the strain tensor e, } transforms to = 
a cos 0 + b sinfl = -e' 23 , ef l2 = -asrnfl + 6cos0 = eJi, e is = = e' 23 = e' 32 = 0. Thus maximum positive 

extension occurs along the angles 9 = tan -1 ( 6 /a) and maximum negative extension along the directions 
^ — (“'°/^) w ‘ttr respect to the z-axis, all lying in the x-y plane. The maximum of shear of magnitude 

2 tan" {6/(1 + a)} takes place about the x-axis with 9 = $tan~ l (-o/ 6 ) with respect to the z-axis lying in 
the x-y plane. There is no strain (that is, the length is preserved) along ±k. Similarly, the directions are 
preserved with non-sero strains along ±(at + bj), ±( 6 » - aj) 

13 w ?),fV = JV l ~ A(<ru + <r «)/( 3A + 2 m)}/2m, C 22 = w 22 - A(< 7 „ + (T 22 )/( 3 A + 2 M )}/ 2 /i, C 33 = A(<r u + 
®'J2)/2 m( 3A + 2p) 

(u) e n = a u (A + 2p)/Ap(\ + p), e33 = -<x u A/4p(A + p), a 23 = A<x u /2(A + p) 

13.7 Use the translational equilibrium equations (13.26), the stress-strain relations (13.43) and solve for the 
strain tensor fields, namely e,,- = e,,(zi,z 2 ,z 3 ) with appropriate boundary conditions. 

13.8 Calculate the bending moment through a section of the strut at a height x to be given by N = 
/12JZ, R— radius of curvature of the strut at height x. Neglect dy/dz compared to 1 , and set up the 

differential equation, knowing that N must also be equal to the torque of the weight about the point at z, given 
^ N = W(y Q — y), where y a = projected horizontal extension or the horizontal coordinate of the point at z. 
Then show that the solution will have the form y = y„(l - cos wz) with the auxiliary condition y 0 cos ul = 0, 
where u = 12W/Ya*. So for ul < t/2, y 0 must be zero, that is, no deviation of the strut from its vertical 
orientation. But as soon as ul becomes equal to t/2 due to increase in W, the above condition for instability 
can be approached. 

13.9 When the chimney is falling as a whole, it rotates by an angle 9(t) about its base point satisfying 
ML 0 /3 = (AfyL/2)sin0. If the chimney breaks at a distance z from the base, its lower end satisfies an 
equation Af z 9/2L = (Mgx 2 /2L) sin 9 + xF -T, where F is the force of shear developed at the breaking point 
and T is the internal flexion torque at the point x._ Similarly, the rotation of the upper broken portion about 
its own cm satisfies an equation [ M(L - z) 3 /12 L)9 — [L - x]F/2 + T. So using these three equations one can 
solve for F and T, as functions of z, the distance to the breaking point. For a thin chimney T > product of F 
with the width of the chimney, and then T is maximum at z = 1/3. (The last two problems are taken from 
the Chicago university problem book.) 

13.10 Irrotational plane waves belong to the compressional longitudinal mode. The phase factor Jbzi — u't 
suggests that the direction of propagation is along the z-axis with u'k = (A + 2p)/p, e n = dui/dxx = 
Akcos(kx 1 -u't), all other e^ = 0. Thus, <r n = Au'pcos(kxi-u't), a 22 - 0-33 = Aeu, and <t 22 I<T\\ — kX/pu' 

13.11 (i) Distance d =1975 km (ii) Angular radius = 114°. 


Chapter 14 

14.1 Ans. 1.4 m/s, height of the jet h = y/v 1 ^ - h„ 

14.2 Ans. v = uhy/2lh — 1 


Copyrighted material 



Appendix C 587 


14.3 Since the liquid is incompressible by nature, the pressure inside the: bubblecannot 
pressure at various levels assume the following values: P 0 + 2 pgh at the top, P 0 + p{ ) 
and P„ + pgh at the level of the cup, H being the total height of the cylinder. 

14.4 (i) Since « is planar, w is perpendicular to the plane, and the result follows from the Kelvin-Helmholtz 

theorem. . 

(ii) p = p 0 exp{ ^y a ( A - }, the density thus not falling to zero at infinity. 

(iii) Start with V x « = 0 

(iv) Start with the Kelvin-Helmholtz theorem. 

(v) Take the curl of both sides of Eq. (14.113) , ,. w 

(vi) Proceed similar to the derivation of Kelvin’s theorem with Euler’s equation replaced by Navier-btokes . 

14.5 Draw the flow patterns for visualisation. 

14.6 Ans. The thickness of the flow d = (irjQ/P9 b sin0) l/3 

14.7 Start from the right hand side and apply Green’s theorem. 

14.8 Use f' = (- u,xr, and find its curl and circulation. 

14.9 Use the expressions for divergence of vectors in orthogonal curvilinear coordinates (see section A2.8). 

14.10 Motion of the fluid near the boundary ought to be normal to Vg. For moving boundaries, - 0, 

where » is the velocity of the boundary at the given point. 

14.11 Since u is given, ( = «xr. Find Vx|, which vanishes only if r*/(«*)= const. 

14.12 Ans. y(x) = D(1 + 2 gx/v 2 )~ 1, * t D = diameter of the mouth of the faucet. 

14.13 Find the conditions for instability. This can be a good re search probl em for graduate students. 

14.14 g = - VS*PeGR T tanh(z ^ Pc GfW ), p = p e sech\z^p e G/RT), p e = central density, R = gas 
constant, and T — temperature of the star (assumed to be isothermal). 

14.15 Ans. F = *pR*u 2 /A, p = density of air outside, R = radius of the discs, and u = angular velocity of 


14.16 Define the equivalent resistance (electrical type) of the Poiseuille flow m a tube of diame 
length L by R = A P/Q = (128fj/w )L/D\ rj being the viscosity of the fluid and l Q the flow rate Let 
flow rate through the original artery drop to Q a = AP/H«, and after the graft of the ypass e o 


w rare mrougn me origmaj atueij uwp — -- / —a,- ° , .. . . j_i d 

:e becomes Q = APlPl 1 + (Jlo + P.)' 1 ), «o being the resistance of the obstruction developed and R, 
e resistance of the graft. Thus Q/Q. = 1 + (Ro + R«)/R t = the improvement ratio of the flows after^and 
fore the graft = function of only L a ,D a ,L„D a ,L 0 and D 0 . The ratio can be about 28 for a choice of 


rate t 
the r 

before the graft = -- -- ,. , . 

L 0 /L, = 0.2, Do/D, = 0.5, L,/L, =: 0.4 and D t /D, = 2, for example. 


Copyrighted material 



APPENDIX D 


Physical Constants 


Universal Constants 


speed of light in vacuum 
permeability of vacuum 

permittivity of vacuum, l//x 0 c 2 
Newtonian constant of gravitation 
Planck constant 
in electron volts, h/e 
h/2ir 

in electron volts, h/e 


Atomic and Nuclear Constants 

elementary charge 
fine-structure constant, fi 0 ce 2 /2h 
Bohr radius, a/4 irR^ 
electron mass 

in electron volts, m e c 2 /e 
Compton wavelength, h/m c c 
X c /2n = aoo = a 2 /AttR^ 
classical electron radius, a 2 au 
Thomson cross-section, ( 87 r/ 3 )r c 2 
electron magnetic moment 
in Bohr magnetons 
proton mass 

in electron volts, m p c 2 /e 
neutron mass 

in electron volts, m„c 2 /e 
Avogadro’s constant 
atomic mass constant 


c 

299 792 458 

ms -1 

Ho 

4tt x 10 ~ 7 

NA “ 2 


= 12.566 370 614... 

10 -7 NA - 2 

Co 

8.854 187 817... 

10- 12 Fm " 1 

G 

6.672 59(85) 

10 - n m 3 kg 

h 

6.626 075 5(40) 

10 - 34 J s 


4.135 669 2(12). 

lO " 15 eV s 

h 

1.054 572 66(63) 

10 ~ 34 J s 


6.852 122 0(20) 

10 ~ 16 eV s 


e 

1.602 177 33(49) 

io - 19 C 

a 

7.297 353 08(33) 

10" 3 

do 

0.529 177 249(24) 

10 - lo m 

m c 

9.109 389 7(54) 

10" 31 kg 


5.485 799 03(13) 

10 -4 amu 


0.510 999 06(15) 

MeV 

A c 

2.426 310 58(22) 

IO ' 12 m 


3.861 593 23(35) 

IO " 13 m 

r e 

2.817 940 92(38) 

IO ’ 15 m 

<T e 

0.665 246 16(18) 

IO " 28 m 2 

Hr. 

928.477 01(31) 

10 —28 JT- 

He/HD 

1.001 159 652 193(10) 


m p 

1.672 623 1(10) 

lO " 27 kg 


1.007 276 470(12) 

amu 


938.272 31(28) 

MeV 

m n 

1.674 928 6(10) 

IO " 27 kg 


1.008 664 904(14) 

amu 


939.565 63(28) 

MeV 

n a ,l 

6.022 136 7(36) 

10 23 mol -1 


Copyrighted 



Appendix D 589 


m u = ±m( 12 C) 
in electron volts, m u c 2 /e 
molar gas constant 
Boltzmann constant, R/Na 
in electron volts, k/e 
in hertz, k/h 
in wavenumbers, k/hc 
Stefan-Boltzmann constant, 
(ir 2 /60)fc 4 //i 3 c 2 


Astronomical Constants 

heliocentric gravitational constant 
geocentric gravitational constant 
Astronomical unit 
equatorial radius of the sun 
equatorial radius of the earth 
angular velocity of the earth 
mass ratio of the earth and the moon 
radius of the moon 


m„ 

1.660 540 2(10) 

10- 27 kg 


931.494 32(28) 

MeV 

R 

8.314 510(70) 

J mol -1 K' 

k 

1.380 658(12) 

10- 23 J K" 


8.617 385(73) 

10- 5 eV K 


2.083 674(18) 

10 10 Hz K- 


69.503 87(59) 

m" 1 K" 1 

a 

5.670 51(19) 

10" 8 W m 


GMq 

1.327 124 38 

10 2 ° m 3 s~‘ 

GMq 

3.986 004 48 

10 H m 3 s~‘ 

1 AU 

1.495 978 706 6 

10 11 m 

Rq 

6.959 9 

10 8 m 

R © 

6.378 137 

10 6 m 

w© 

7.292 115 146 7 

10- 5 s" 1 

M©/M 

81.300 813 

10 6 m 

R 

1.738 2 


Copyrighted material 



Index 


Acceleration 6, 9, 10, 544, 545 
Acceleration of table fan blades 113, 116 
Acceleration due to gravity 129 
Acrobatics 438 
Action-angle variables 292 
completely degenerate 296 
for 1-D harmonic oscillator 295 
for the Keplers problem 295 
Action variables as adiabatic invariants 
299, 301 

Adiabatic fluid flow 484 
Adiabatic invariants 299 
for 1-D harmonic oscillator 301 
for a charged particle in a magnetic field 
302 

Admissible coordinate transformations 548 
improper 552 
proper 552 

Aerodynamic lifting force 
Affine transform 136 
Amontons G 16 
Analytic functions 502 
Cauchy-Riemann conditions for 503 
Angle of attack of an aerofoil 509 
Angle of scattering in CM frame 160 
in lab frame 167 
Angular momentum 349 
equivalence with moment of momentum 
350 

Angular momentum 81 
measured in a rotating frame 190 


Angular momentum in lab and CM frame 
366 

Angular velocities addition of 391 
Aphelion 132 
Apoastron 132 
Apocenter 132 
Apogee 132 
A ppell, Paul 32,45 
Appell’s equation of motion 92 
Apsidal line (apsis) 132 
Areal velocity 546 
Areal velocity 127 
vector 131 
Aristotle 8 

Aristotle’s law of motion 6 
Arnold, V 1 298 

Artificial satellites orbits of 142 
Hohman transfer orbit for 178 
geostationary orbit 143 
geosynchronous orbit 142 
launching of 143 
Astronomical unit AU 135 
Asymmetric top 355 
Athlete, physical statistics of 425 
Atwoods machine 
Atwoods oscillator 333 
Avempace 6 

Azimuthal quantum number 308 
Azimuthal symmetry 161 
Bernoulli, Daniel 222, 476 
Bernoulli,Jacob 222 
Bernoulli,Jean 31, 198, 222 
Bernoulli equation 476, 485 
Bernoulli’s theorem 486, 490 


Copyrighted material 




Index 595 


applications of 492, 496 
interpretation of 490 
special cases of 489 
Bhatnagar, P L 301 
Biharmonic equation 469 
Body cone 381 

direct and retrograde motions of 384 
relative motion with space cone 382 
Body force 10, 455 
Body waves 496 
Bohr model of atom 302 
Bohr-Sommerfeld quantisation rule 307 
Bondi,Hermann 446 
Boomerang 414 
Born, Max 306 
Brachistochrones 223 
for uniform force 223 
inside a gravitating sphere 230 
on a surface of a cylinder 235 
Bradwardine, Thomas 0 
Bulk modulus of elasticity 463 
Bulk strain 452 
Bunsen’s pump 493 
Burgers, J M 276, 299 
Candela 4 

Canonical equations of motion (see Hamil¬ 
ton’s equations) 

Canonical momentum 21, 10 
Theorem on the conservation 16 
Canonical transformations (CT) 236, 238 
generating functions for 238, 241, 243 
Maxwell-like relations for 243 
as contact transformations 244 
extended 239 
univalent 239 
valence of 239 

Canonical transformations, properties of 

245,242 

examples of 248, 252 
generated by Hamilton’s principal func¬ 
tion 254 
group of 246 


of anharmonic oscillator Hamiltonian 
261 

preservation of volume in phase space 

9-45 

to the free particle Hamiltonian 252 
Canonicality conditions in terms of PB 
and LB 272,273 
Canonicality, conditions for 243 
of Galilean transformation 252 
of Lagrangian gauge transformation 249 
of electromagnetic gauge transforma¬ 
tions 248 

of infinitesimal coordinate transforma¬ 
tions 250 

of infinitesimal evolution 251 
of infinitesimal rotation 250 
of transformation to rotating frame 252 
Cantor, Georg 4 
Capillary waves 499 
Cartwheels rolling on an incline 86 
Cauchy Augustine Louis 357, 447, 476 
Cauchy’s'stress quadric 459 
Celestial equator 135 
latitude 135 
longitude 135 
Central force 59,118 

conservation of orbital angular momen¬ 
tum in 120, 122 
definition 119 

equation of the orbit in 120, 124 
formal solution of 126 
planarity of 120 
properties of 119 

Central force constants required to specify 
an orbit 120 

expression for the energy integral 120 
scattering in 158, 171 
Central force, stability of orbits in 124 
condition for closure of orbits 125 
integrable power laws of 126 
Central force two-body problem 120 
condition for bounded motion 123 
effective potential for radial motion in 
122 


Copyrighted material 








596 Index 


Centre of force 119 
Centre of mass 17 
theorems on 442 
Centrifugal force 2, 8, 21,99 
Centrifugal potential 100 
Centripetal force 154 
Chandler wobbling of the earth 319 
Chaos 63, 476 

Charged particle in a constant electric field 
116 

Charged particle in magnetic field 544 
Chasles’ theorem on rigid displacements 
339 

Christmas tree toy 413 
Circulation 509 
Kelvin’s theorem on 510 
Classical-quantum analogies 262, 302, 308 
Classification of crystals 464 
Classification of equilibria 311 
Closed system 11 

additive constants of motion for 32 
number of independent integrals for 94 
Coefficient of shearing viscocity 516 
of bulk viscosity 516 
of sliding friction 16 
of static friction 16 
of viscous drag 434, 438, 520 
values for motion of vehicles through air 
520 

Collision of elastic bodies 8 
Commutator relations 305 
Comoving time derivative 479 
Compliance constants 466 
Compound pendula 325 
Compressibility of a fluid 477 
Compressional elastic wave 470 
speed of 470 

Conditionally periodic motion in phase 
space 298 

Conditions for integrability of a hamilto- 
nian system 298 
Configuration space 60,137 
extended 60,137 
trajectory in 60 


Conic sections 132 
Conical pendulum 58 
Conservation of linear momentum 11 
Conservation of angular momentum of 
planets 127 

Conservative systems 67 
energy integral for 67 
Constraint equations 32,53 
classification of 33 
properties of 33 
Constraint forces 32, 34 
properties of 32 
work done by 34 
Constraint forces 21 
basic problem with 40 
Constraints 33 
bilateral 33, 35 
conservative 33, 35, 37 
definition 32 
dissipative 33,37 
examples of 35, 38 
holonomic 33, 35, 37 
nonholonomic 33, 41, 86 
rheonomic 33, 37, 50 
scleronomic 33, 35,37 
unilateral 33, 37,50 

Construction of new constants of motion 
using PBs 265, 266 
Contravariant basis vectors 531, 552 
coordinates 531 
Controlling body weights 439 
Coordinate approximation to inertial 
frames 527 
curve 548 

curvilinear 528, 548 
cylindrical polar 529 
frames 525 
lines 525 

nonorthogonal parabolic 532 
oblique cartesian 527, 531 
orthogonal 525, 530 
orthogonal parabolic 530 
orthogonal prolate spheroidal 530 
position 5, 530 


Copyrighted material 





Index 597 


rectangular cartesian 525 
spherical polar 527 
surface 548 

Coordinate frames, handedness of 534, 535 
Copernicus, Nicholas I 
Coriolis,Gustave Gaspard de 36 
Coriolis deflection of a projectile 106 
Gantmakher’s formula for 105 
Reichs experiment for 106 
Coriolis effect on river flows 101 
on cyclones 103 
on projectile motion 104 
on trade winds 103 
Coriolis force 21, 39 

as a gyroscopic force 113 
effects of 101, 108 
Cornu’s spiral 442 
Coulomb’s law of friction 16 
Covariant basis vectors 531, 551 
coordinates 531 
Curl of a vector function 537 
in orthogonal curvilinear coordinates 550 
Curvilinear coordinates 525 
Cycling 434 
Cycloids 223 

as a family of tesserals 228 
as a tautochrone 226 
equations of 225 
Cylindrical coordinates 529 
D’Alembert, Jean le Rond 32, 311, 476 
D’Alembertian system 48 
D’Alembert’s principle 47,61 
some applications of 49 
Darwin, G N 150 
De Broglie hypothesis 306 
Deep sea waves 498 
Deformable bodies 26 
Degenerate systems 282 
Degrees of freedom 57 
Delaunay, Charles 216 
Density (fluid) All 

distribution under external field 485 
Descartes, Rene 7 


Development of mechanics up to Newton 

6,8 

Dilation (see also bulk strain) 451 
DOF for 452 
Dirac,Paul 262 
Direction cosines 527, 546 
Displacement vector field 448 
for isotropic body 469 
in a plane irrotational elastic wave 475 
Dissipative forces 67 

Euler-Lagrange equations for 68 
Double pendulum 90 
Drag force, quadratic law of 438 
Dynamic equilibrium 312 
Dynamics 2 
Eccentric anomaly 136 
Eccentricity of planetary orbits 132, 135 
relation with conic sections 133 
relation with specific energy 133 
vector 132 
Ecliptic 135, 416 

angle of inclination with 135 
Effective mass of electron (hole) 195 
Ehrenfest, Paul 262 
Einstein,Albert 447 
Einstein and de Haas experiment 443 
Einstein summation convention 61, 195, 
204, 239, 534 

Elastic bodies, condition for translational 
equilibrium 457 

conditions for rotational equilibrium 458 
work done on, due to infinitesimal defor¬ 
mation 460 

differential of the free energy of 460 
differential of the Gibbs function for 461 
Elastic moduli 462 

interrelation between 468 
Elastic waves in isotropic media 469, 473 
nature of plane wave solutions to 471 
longitudinal 472 
transverse 472 
Electric dipole moment 3 
multipole moments 3 
Electromagnetic wave equation 303 


Copyrighted material 







598 Index 


Electron Paramagnetic Resonance EPR 
332 

Ellipsoid of inertia (see inertial ellipsoid) 
Elliptic integrals 121 

Energy of a particle in rotating frame 100 
Enthalpy function 484 
Epicycloids, equations for 232 
Epoch of the perihelion passage 135 
Epstein P 226 

Equation of a planetary orbit 132 
in velocity space 141 
Equation of continuity 480, 524 
application to Liouville’s theorem 482 
Equation of state 418 
Equatorial quantum number, see magnetic 
quantum number 
Equilibrium state 311 
dynamic 312 
metastable 312 
stable 312 
static 311 
unstable 312 

Equilibrium state of a fluid 480 
Equivalence principle 48 
Euclidean metric tensor 551 
determinant of 552 
Euclidean space 3, 525 
Euler-Lagrange equation from Hamilton’s 
principle 209 

Euler-Lagrange equations 65 
invariance under generalised coordinate 
transformations 23 

Euler, Leonhard 2, 23, 31, 198, 335, 426 
Euler force 99 
Eulerian angles 392 
angular velocities for 395 
line of node 392 
rotation matrix for 394 
Eulerian rotations 111 
Eulerian time derivative 429 
Euler’s analytical method for 328 
motion of angular velocity vector for 380 
of a top 388 

Euler’s equation of fluid motion 483 


Euler’s equation of rigid body motion 321 
modified 322 

Euler’s theorem on rigid displacements 338 
on inertial velocity of a rigid body 339 
Evolute and involute 228, 543 
Extended objects, properties of 3 
Fermat Pierre de 8, 199 
Fermat’s principle for the propagation of 
light rays 8 

Fermat’s principle of least time 199, 222 

Fermi, Enrico 333 

Feynman,Richard 302 

First law of thermodynamics 51 

Flow through a pipe 512, 521 

Fluid 412 

imperfect or non-ideal 422 
in equilibrium 472, 480 
perfect or ideal 422 
compressible 422 
barotropic 428 
incompressible 427 
Fluid dynamical variables 422 
Fluid dynamics, chapter 14 476 
central problem of 478 
Flux density 158 
Foucault, Leon 108 
Foucault’s pendulum 108, 112 
earth’s period of rotation from 109 
Force 9 ' 

Force field 118 
central 119 

Force laws, examples of 14,16 
Forced vibration 322 
equation of motion for 328, 329 
average potential and kinetic energies of 
329. 330 

energy dissipated by damping force 330 
expressed in terms of normal coordinates 

328 

Frame of reference 3 
Free rotation of a rigid body 372 
Frenet-Seret formulae 542 
Galilean law of inertia 2 
Galilean transformation 81 


Copyrighted material 




Index 599 


Lagrangian gauge function for 82 
change of energy and momentum under 

94 

Galileo Galilei 2 

Galileo’s laws of falling bodies 2,7 
Gas 417 

Gas flow through a nozzle 493, 496 
sonic 496 
supersonic 496 
Gauss,Carl Frederic 106, 536 
Gauss constant of gravitation 135 
Gauss divergence theorem 539 
General solids, elastic properties of 464, 

466 

Generalised Hamiltonian, Dirac’s formula¬ 
tion 220 

Generalised coordinates 59 
cylic 25 

Generalised force 62 
Generalised momentum 20 
for a charged particle in magnetic field 

12 

Generalised potential function 05 
for charged particle in em field 20 
relation with gyroscopic forces 69 
Generalised velocity 60 
kinetic energy in terms of 63 
Geodesics 218 
Geographical latitude 102 
Geoid 104 
oblateness of 104 
Gibbs, Josiah Willard 45, 534 
Gibbs’ potential 484 
Gibbs-Appell principle 45 
Gradient of a scalar function 536 
in orthogonal curvilinear coordinates 550 
Gravity waves 495 
critical wavelength of 498 
dependence on wavelength 498 
group velocity of 499 
speed of 492 
Green’s theorem 538 
Group velocity 192 


Gugliemini of Bologna 106 
Gyroscope (Foucault’s) 419 
Gyroscopic forces 62 
work done by 22 
HJ equation for projectiles 309 
for a particle in the field of a dipole 310 
Hall,EH 106 
Halley, Edmond 2 

Hamilton,Sir William Rowan 2, 23, 55, 
132, 180, 185, 236 

Hamilton-Jacobi equation (time depen¬ 
dent) 215, 226 
complete integral of 226 
connection with canonical transforma¬ 
tion 254, 229 

first and second "integrals 222 
for Kepler’s problem 285 
for central or axisymmetric forces 282 
for damped harmonic oscillator 289 
for simple harmonic oscilator 283 0 

for swinging Atwoods machine 282 
in parabolic coordinates 286 
in parabolic cylindrical coordinates 288 
method of solving dynamical problems 
using 228 

necessary and sufficient condition for 
separability of coordinates in 282 
procedure to find complete integral of 
281 

time independent 215, 229 
Hamiltonian 183 
as a constant of motion 184 
for relativistic particles and light rays 
191 

importance for quantisation 185 
properties of 184 
Hamiltonian flows 256 

area conservation property of 255 
equivalence with canonical transforma¬ 
tions 257 

Hamilton’s characteristic function 180, 214 
Hamilton’s equations of motion 180, 183 
from Hamilton’s principle 210 


Copyrighted material 



600 Index 


symmetry w.r.t. q x and p, 185 
restriction to holonomic systems 185 
Hamilton’s principal function 212 
Hamilton’s principle 198, 206 
conditions for validity 202 
invariance under generalised coordinate 
transformations 211 
significance of 218 
Harmonic functions 502 
Harmonic oscillator (isotropic 2-D) 2fi5 
Runge-Lenz tensor for 266 
Harmonic oscillator 1-D 253 
HJ equation for 283 
action-angle variables for 295 
adiabatic invariants for 301 
damped 289 
1-D relativistic 334 
1-D, equation of motion 314 
Hatzfeld,Johann von 31 
Heisenberg.Werner 262, 306 
Helmholtz, Herman von 476 
Helmholtz’s vorticity theorem 512 
application of 512 
Herschel, William 335 
Hertz, Heinrich 53 

Hertz’s principle of least curvature 53 
Hess’s integral 329 
Heytesbury, William 6 
Hogen,J G 102 

Homogeneity and isotropy of space 9, 216, 
217 

Homogeneity of space 28 
and conservation of linear momentum 
79,212 

Homogeneity of time 9, 78,216 
and conservation of energy 217, 19 
Hooke, Robert 8, 31,442 
Hooke’s law of elasticity 8, 126, 465 
Hurricanes 498 

Huygens,Christiaan 8, 208. 222, 223 
Huygens’ theory of light propagation 8 
Hydrostatics 8 
Hypocycloid 23Q 
equations for 231 


Impact parameter 159 
Impetus 6 

Imperfect fluids 515 
Navier-Stokes’ equation for 517 
rate of strain tensor for 516 
Impulse 88 

generalised component of 88 
instantaneous 88 
Incline, acceleration of 49 
Incompressible fluid flows around simple 
shapes 507 

around a rectangular corner 505 
around an angle 505 
around axially symmetric shapes 508 
around infinite plane sheet 506 
Incompressible fluids, steady irrotational 
flow in 2-D 503, 523, 509 
examples of 505, 506 
Inertial ellipsoid 357 
principal axes of 357 
Inertial forces 20, 96* 384 
electromagnetic analogy of 100 
Inertial frames 9, 69, 78,81 
Infinitesimal rotations 115 
Integrable systems 297 
behaviour under perturbation 298 
Integrals of motion 26 
Internal forces 12 

Interrelation of stress and strain tensors 

468 

Invariable plane for a rotating rigid body 

325 

Inverse mapping theorem 247 
Inverse mass tensor 194 
Inverse square law of force 118, 126, 128 
outside and inside of a spherically sym¬ 
metric body 126 
Inverted pendulum 334 
Isentropic fluid flow 484 
Isochronal motion along a cycloid 8 
Isochrone (see tautochrone) 

Isothermal bulk modulus 467 
Isothermal fluid flow 484 
Isothermal speed of sound 471 


Copyrighted material 




Index 601 


Isotropic bodies, conditions for transla¬ 
tional equilibrium of 
interrelations between elastic constants 
for 467 

interrelations between stress and strain 
tensors for 468 

isothermal bulk modulus of 467 
limits on Poisson’s ratio for 467 
Isotropic bodies, propagation of elastic 
waves in 469, 473 
Isotropic harmonic oscillator 196 
Isotropic solids 461 
elastic moduli for 462 
elastic properties of 466, 469 
forms of free energy and stress tensor for 
461 

Isotropy of space 78 

and conservation of angular momentum 

80, 217 

Jacobi, Carl Gustav Jacob 2, 23, 236, 262, 
276,476 

contribution to HJ formalism 214 
Jacobi integral 71, S3 
Jacobi-Poisson theorem (see Poisson’s sec¬ 
ond theorem) 

Jacobi’s identity for PBs 263 
Jacobi’s theorem 277 
proof of 278 
Jordan,Pascal 306 
KAM theorem 298 
curves 299 
surfaces 299 
Kane,T R 45 
Kelvin, Lord 476 

Kelvin’s theorem on circulation 510 
Kepler, Johannes 7 
laws of planetary motion 2, 7,127 
Kepler’s equation for elliptic orbits 138 
for hyperbolic and parabolic orbits 138 
Kepler’s problem of planetary motion 2, 
130 

in velocity space 140 
HJ equation and its solution for 285 
Kinematic viscosity 517 


Kinematics 2 

derivation of dynamics from 127 
Kinetic energy 18 
Kinetic energy function 63 
Kinetic energy of a particle 18, 545 
of acceleration 45 
Kolmogorov, A N 298 
Kothari,D S 301 
Kowalevski’s integral 379 
Kronecker delta symbol 534 
Lab and CM frames relation between scat¬ 
tering variables 167 
Lagrange bracket (LB) 211 
relation with PB 272 

Lagrange, Joseph Louis 12, 23, 43, 55, 198, 
262, 311. 335.476 

Lagrange’s equations of motion 61, 62 
for impulsive forces 88 
for nonholonomic systems 85 
Lagrange’s equations of the first kind 41, 

43 

Lagrange’s principle of least action 199, 

205 

conditions for validty 205 
Lagrange’s theorem on stable equilibrium 
313 

Lagrange’s undetermined multiplier 43,85 
Lagrangian 65 
for a projectile 76 
gauge function for 72 
properties of 69 

Lagrangian for a pendulum bob 93 
for anharmonic oscillator 196 
for central force two body problem 121 
for free particle in various coordinates 

83, 84 

for particle in a rotating frame 99 
for relativistic particles and light rays 
189 

for relativistic pendulum 93 
for various instances of dynamical sys¬ 
tem 92 

Lagrangian invariance under Galilean trans¬ 
formation 217 


Copyrighted material 





602 Index 


Lagrangi&n near stable equilibrium 314 
Lagrangian time derivative 479 
Lambert 31 

Lame’s coefficients 462 
Laplace’s equation 303 
Latitudal quantum number 308 
Law of equipartition of energy 02 
harmonic forces 126 
Least constraint function 45 
Legendre’s dual transformation 181 
application to thermodynamic potentials 
182 

connecting generating functions for CTs 
241 

extension to include passive variables 
182 

Leibniz,Gottfried 19, 31, 198, 223 
Lengthening of day 131 
Levi-Civita T 226 
Levi-Civita symbols 535 
antisymmetric 535 
connection with Kronecker delta 535 
Liapounoff 311 
Libration 292 

Liouville’s theorem 254, 482 
Liquid 422 

incompressibility of 422 
Lissajous’ figure 294 
Long jump, maximum range of 430 
Longitudinal mass of a particle 193 
Lorentz force 15, 296 
Lorentz gamma factor 192 
Lorentz invariance 212 
Magnetic quantum number 308 
Magnus force of lift 438 
Marsilius of Inghen 2 
Mass tensor for a nonrelativistic system 
194 

Maupertuis, Pierre de 199 
Maupertuis’ principle of least action 199, 

220 

comparison with Fermat’s principle 208 
Maxwell’s electrodynamical equations 302 
Mean anomaly 136 


Mean speed theorem 6 
Metastable equilibrium 312 
Modulus of rigidity 464 
Moment of inertia 3, 172, 355 
about any arbitrary direction 355 
Moment of inertia tensor 348 
expressions for the elements of 351 
Moment of inertia tensor of a homoge¬ 
neous pyramid 358 
experimental determination of 362 
of a homogeneous ellipsoid 360 
of earth 361 
table of 363 - 366 

Moment of inertia tensor, products of iner¬ 
tia 352 

changes under translation 352 
parallel axes theorem 353 
perpendicular axes theorem 353 
principal axes transformation 354 
principal moments of inertia 354 
Moment of inertia, theorems on 442 
Moment of momentum 17, 81, 545 
Momental ellipsoid 357 
Moments of inertia of human body 426 
Moser 298 

Mossbauer spectroscopy 332 
Motion of a motorcycle 415 
Napier, John 6 

Navier-Stokes’ equation 517, 523 
Neap tides 156 

Newton,Sir Isaac lj 8, 198, 2Q8 
Newton’s equations of motion 2, 8, 14,131 
for variable mass 19, 2D 
Newton’s law of gravitaion 14, 127, 129 
of causality 9,122 
of inertia 9 
of reciprocity ID 
of superposition 11 

application to the system of particles 16, 

IS 

different interpretations of 12 
Nicole de Oresme 6 
Nodal line, node 135 
ascending 135 


Copyrighted material 




Index 603 


descending 135 
longitude of 135 
Noether, Emmily 199 
Noether’s theorem 215 
Nonpotential forces 65, 86 
with linear dependence on generalised 
velocities 51 

Normal coordinates, nondegenerate case 
324 

degenerate case 325 
Normal modes of oscillation 319 
amplitudes of 319, 320 
characteristic equation for 320 
degeneracy of 321 
eigenfrequencies for 321 
orthonormality of 322 
Nuclear Magnetic Resonance NMR 332 
Null circulation theorem 509 
Observer 3 

Orbit construction of 139 
geometry of 139 

stability of (for a central force) 124 
Orbit of a particle 540 
binormal to 541 
principal normal to 541 
tangent to 540 
torsion of 542 
Orbit, see also trajectory 3 
construction of trajectory, see also orbit 
3 

of a simple pendulum in phase space 188 
of a system in configuration space 199, 
201 

of a system in phase space 181 
planetary 133 
Orbital elements 135 
construction of an orbit from 139 
Ordinary potential function 64 
Orthogonally decomposable systems 292 
Oscillator isotropic 130 
P and S waves 472 
shadow zone for P waves 473 
Parabolic ballistics 8,15 
Paraboloid of revolution 514 


Particle 3 

a quantum particle 3 
differential displacement of 550 
mass of 3 

ontological status of 3 
spin of 3 

Particle motion 543 

kinematics in spherical polar coordinates 
546 

kinematics of 543 
Pascal,Blaise 8 

Pendulum with variable length 36 
Periastron 132 
Pericenter 132 
Perigee 132 
Perihelion 132 
longitude of 135 

Periodic systems in phase space 292 
completely degenerate 294 
degenerate 294 
Permutation symbols 535 
Pfaffian 259 
Phase fluid 188, 482 
Phase space 187 
natural motion of 254 
Planck units 23 
Planck’s law 304 
Playing ball games 438 
Poincare,Henri 311 
Poincare integral 379 
Poincare’s recurrence theorem 298 
Poinsot’s geometrical construction 373 
Point transformations 74, 237 
extended 238, 248 
Point vortex 514 
Poiseuille 476 

Poiseuille’s formula 517, 518 
application to blood flow 524 
Poisson, Simeon Denis 23, 262 
Poisson bracket (PB) 259, 262 
anticommutative property 263 
connection with rotations 269 
distributive property 263 
elementary 264 


Copyrighted material 




604 Index 


identities satisfied by 263 
invariance under canonical transforma¬ 
tions 267 

involving angular momentum 268 
Poisson’s ratio for elastic bodies 463 
Poisson’s second theorem on PBs 265 
Poisson’s theorem 264 
Polhodes and herpolhodes 316 
Potential energy 18 
Power 18 

Power consumption in human activities 
427 

Precession of a flywheel 116 
of angular velocity vector 386 
of a freely rotating body 384 
of earths axis of rotation 416 
of the perihelia of planetary orbits 144 
in Hall potential 178 
in Yukawa potential 178 
of earth’s satellites 147 
of Mercurry 148 
Pressure 457 
in a fluid 411 
on river banks 492 
Pressure potential of a fluid 483 
Principal oscillations 324 
Principal quantum number 368 
Principle of least action 198, 199 
least constraint 45 
virtual work 39 
Problem of isoperimetry 198 
Products of inertia 352 
Pseudoforces (see inertial forces) 
Pseudoscalars 537 
Ptolemy, Claudius 7 
Ptolemy’s epicyclic model 129 
Q factor 330, 332 

Quasi-generalised coordinates 60, 86 
Quasi-periodic taotion (see conditionally 
periodic motion) 

Rabi 1333 
Race walking 428 
Radial quantum number 368 
Radial velocity of a planet 146 


Rate of strain tensor 516 
Rayleigh dissipation function 68 
Recoil angle 167 

Refractive index of an optical medium 192 
Relativistic mass tensors 192 
principal axes transformation for 193 
Resonance 330 
displacement 330 

full-width at half-maximum for 332 
velocity 331 

Resonant frequency 330 
displacement 330 
velocity 331 

Reverse force of inertia 48 
Reynolds, Osborne 476 
Reynolds’ number 519 
Rheonomic systems 59, 66, 93 
Ricatti 31 

Riemann-Christoffel symbols 221 
Rigid body, DOF of 336 
generalised coordinates for 331 
independent constant of motion for 337 
instantaneous angular velocity of 343 
moment of inertia tensor for 338 
velocity vector, screw motion view of 
343 

Euler’s equations of motion for 371 
rotational kinetic energy time variation 
for 372 

rotation of 372 
arbitrary rotations of 390 
constraints 35 

instantaneous axis of rotation of 344 
kinetic energy of 347 
body frame for 345 
frames of reference for 345 
instantaneous inertial velocity of a parti¬ 
cle in 346 

translational and rotational 348 
stability conditions for motion wrt rotat¬ 
ing frames 
Ripples 498 
speed of 499 
Rizzetti, Giovanni 31 


Copyrighted material 






Index 605 


Roche limit 157. 179 
Roll, pitch and yaw 390 
Rolle’s theorem 313 
Rolling without sliding 21 
Rotating frames 96 

relation with velocity and acceleration in 
a fixed frame 39 
time derivative of a vetor in 98 
Rotation 292 
Routh, Edward John 336 
Routhian 185 

effective reduction in DOF 186 
Runge-Lenz vector 131, 179 
modified 274 
specific 146 

Running, maximum speed of 429 
Rutherford’s formula 162 
SI units 4 

Scalar product of vectors 535 
Scalar velocity potential 486, 503 
Scalars 554 
Cartesian 554 
Lorentzian 554 
Scattering 158, 171 
by inverse square law force 161 
conservation of linear momentum 164 
conserved quantities in 164 
elastic 165 

energy conservation in 165 
Scattering cross-section differential 159 
enhancement of 163 
in lab and CM frames 170 
total 158 

Scattering momentum and energy transfer 
in 165 

of a spacecraft by Jupiter 166 
Scattering of protons through matter 170 
Schrodinger’s equation 305 
Schwarzs child, Karl 276 
Schwarzschild metric 220 
Scleronomic systems 59, 66 
Seismic waves 472, 475 
Semi-latus rectum of planetary orbits 132, 
134 


Shallow water waves 500 
Shearing strain (shear) 452 
DOF for 452 

general, as a combination of three simple 
shears 454 
Shock front 496 
Signal to noise ratio 332 
Similarity transformation 324 
Simple pendula isochronous motion of 7 
Simple pendulum 36, 44, 58 
Simple pendulum in a moving lift 197 
Simple pendulum phase space trajectory of 
189 

Sliding friction, work energy relation for 

50, 53 

Small oscillations about equilibrium config¬ 
uration 315 
kinetic energy for 319 
of a massless spring 316 
of a mercury column in a U tube 316 
of diatomic molecules 135 
positivity of 318, 319 
potential energy for 318 
study of, using generalised coordinates 
317, 327 

Snell Willebrord 7 
Snell’s law of refraction 8,298 
Solid angle 546 
Solid body 441 
deformation of 441 
virtual deformation 459 
Solid tides 156 
Sommerfeld, A mold 335 
Space cone 382 
Space-time continuum 5 
Special theory of relativity 3, 22 
Specific angular momentum 159 
Specific energy 132 
gain in 134 

of colliding particle 159 
Spectral lines 398 
Spherical pendulum 49, 59, 334 
Spherical polar coordinates 527 
Spherical top 355 


Copyrighted material 



606 Index 


Spontaneous action II 
Spring tides 156 
Stability of circular orbits 124 
in screened coulomb potential 176 
Stable equilibrium 312 
Stackel.P 226 
Stark effect 310 
State space 182 
Static equilibrium 311 
Stationary flow of a fluid 480, 482 
Steady flow of a fluid 480, 481 
Stiffness constants 
Voigt’s notation for 465 
for a cubic crystal 466 
Stokes,George 476 

Stokes’ law of viscous drag 15, 68, 521 
Stokes’ theorem 540 
Strain ellipsoid 453 
Strain energy 459 
Strain, homogeneous 449 
Strain potential 453 
Strain tensor 449 
connection with rotation 450 
symmetric and antisymmetric parts of 
450 

Strain tensor in cylindrical polar coordi¬ 
nates 474 

compatibility conditions for 474 
Stream function 489, 505 
Streaming and shooting flows 501, 503, 524 
Streamlines 487 
tubes 487 
Stress 455 

external and internal 455 
negative stress (pressure) 457 
normal 455 
shearing 455 
traction 457 
Stress ellipsoid 459 
Stress tensor 455, 457 
Strolling 427 

Substantial time derivative 479 
Sucking effects 492 
Supernova 1987A 178 


Surface force 455 
Surface tension 498 
Swimming 435 
Swinehead, Richard 6 
Swinging Atwoods machine, HJ equation 
for 287 

Symmetric strain tensor 450 
diagonalisation of 451 
independent elements of 350 
invariance of its trace under rotation 451 
Symmetric top 355 

by Lagrangian method 396 
by Eulerian method 397 
effect of friction on 409, 411 
nutation 408 
rise and fall of 401 
rising top 409 
sleeping top 402, 404 
steady precession of 386, 390, 405 
free rotation of 388 
stability analysis of 407 
Tautochrones 223 
in centrifugal force field 232 
inside a gravitating sphere 230 
Taylor, Brook 31 
Tensor field 563 
Tensors 559 
Cartesian 561 
importance of 562 
properties of 560 
rank of 561 
Tesserals 228, 232 
Throwing 432 
a discus 434 
a javelin 433 
a shot put 433 
Tidal bulge 150 

Tidal forces on the earth due to the moon 
152 

work done by 155 
Tidal heights 156 
Tidal lag 155 
Tidal torque 157 
dissipation 157 


Copyrighted material 




Index 607 


Tides 150,151 
in a day 155 
Time 3 

as parameter for a path in phase space 
201 

continuity of 4 

end point variations in principle of least 
action 204, 202 

in extended configuration space 137 
instant of 4 

of passage through periapsis 135 
Time rates of change of quantities, types 
of 478 

Time reversal transformation 275 
Tippe top 411 

Torque relation with angular momentum 
308 

Torricelli, Evangelista 8 
Torsion 453 

Torsional elastic wave 470 
speed of 471 

Total energy function 18 
Total time derivative 479 
Traction 457 

Trajectory of a light ray through atmo¬ 
sphere 197 

Transverse mass of a particle 193 
Transverse velocity of a planet 140 
True anomaly of planetary orbits 132, 136 
Tshapliguine’s integral 379 
Turbulence 476, 520 
Tycho Brahe 2 
Uncertainty principle 304 
Units of measurement 4, 554 
Unstable equilibrium 312 
Vector field, divergence of 536 
curl of 536 
flux of 539 
Vectors 534 
axial 535 
polar 535 


in orthogonal curvilinear coordinate sys¬ 
tems 549, 552 

in general curvilinear coordinate systems 
552, 554 
Cartesian 555 

addition and multiplication of 558 
connection with orthogonal transforma¬ 
tions 556 

covariant and contravariant 557 
ordinary differentiation of 537 
integration of 538 
partial differentiation of 536 
Velocity 544, 545 

Velocity dependent potential function 65 
Velocity vector field in a fluid 477 
Velocity vector potential 486 
Venturimeter 493 
Vernal equinox 135 

Vertical jump, maximum height of 431 
Virial of a system 171 

connection with potential energy 172 
of charged particles in a magnetic field 
of ideal gas 173 
Virial theorem 171, 175 
application to stability of a star 174 
Virtual displacement 38, 53 
Virtual work 38 

done by the impulsive forces 89 
due to constraint forces conditions for 
vanishing 46 
Viscosity 476 
Vortex filament 514 
Vortex lines 487 
tubes 487 

Vorticity vector 486 

Wallis,John 29,31 

Wave packet 304 

Weierstrass 325 

William of Ockham 2 

Women in sports 439 

Work done 18 

Wren, Sir Christopher 8 

Young’s modulus of elasticity 462 

Zeeman effect 296 


Copyrighted material 





Nicholas Copernicus 
1473-1543 


Galileo Galilei 
1564-1642 




Johannes Kepler 
1571-1630 



Piene-Simon Marquis de Laplace 
1749-1827 


Copyrighted material 















CLASSICAL MECHANICS 


The book presents a lucid treatment of classical mechanics with an emphasis on 
the understanding of the fundamentals. It develops an appreciation of the 
versatility of practically all the fundamental principles of physics. The book 
incorporates the recent developments in classical mechanics over the past four 
decades, and discusses in detail such topics as constrained systems, 
Lagrangian and Hamiltonian systems, canonical transformations. Hamilton- 
Jacobi theory, small oscillations, rigid body dynamics, central force problems, 
elasticity and fluid mechanics. 

Salient Feature s 

• Presents the history of development of classical mechanics. 

• Gives short biographies of scientists in the introduction to almost every chapter. 

• Includes over 200 problems with hints and answers, and a bank of short 
questions. 

• Gives real-life applications like launching geostationary satellites, scattering a 
spaceship off Jupiter, etc. 

• Analyses everyday experiences like riding a motor cycle, swimming, walking, 
running, and also discusses games and sports, toys of various kinds.and so on 

• Includes qualitative analyses of the more advanced topics like Hamiltonian 
dynamics ih phase space, integrate Hamiltonian systems, KAM theorem, 
origin of tides, precession of planetary orbits* microscopic origin of friction, etc. 

With its uptodate and comprehensive coverage this book would ideally meet the 
requirements of both undergraduate and postgraduate students of physics and 
engineering. 

Narayan Chandra Rana, a graduate from the University of Calcutta, has been with 
the Theoretical Astrophysics Group, Tata Institute of Fundamental Research, 
Bombay since 1977, and received his Ph D in physics from the University of 
Bombay in 1983. His research interests include the origin of the Cosmic 
Microwave Background Radiation, origin of light elements in the Big Bang, origin 
and distribution of heavy elements in the galaxies, rotation of the earth, positional 
astronomy, and history of astronomy. He received the INSA Young Scientist 
Award for the year 1983 and the year 's best thesis award in the School of Physics, 
TIFR. He is actively involved in teaching physics and popularising astronomy. He 
has published over 50 scientific papers and 100 articles in various journals and 
magazines, and co-authored another book Our Solar System with Prof. A.W. 
Joshi. Presently he is at the Inter-University Centre for Astronomy and 
Astrophysics. Pune. 

Pramod Sharadchandra Joag is presently a senior lecturer in the Department of 
Physics, University of Poona. He received his Ph Din experimental solid state in 
physics in 1982 from the same university. He has been teaching basic courses in 
physics at the post-graduate level for the last eleven years. His present research 
interests include the dynamical systems theory, chaos, growth problems, neural 
networks and differential equations 



p| Tata McGraw-Hill 


Publishing Company Limited 

7 West Patel Nagar, New Delhi 110 008 



ISBN-131 N7*-0-07-«a031S-X 
ISBN-10: 0-a7-Nka31S-1 


VbH ir wcbfllr »« ; m uumcyrawhill 


Copyrighted material 










